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electron concentration, and the Fe distribution coefficient. The 
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on the sintered density and magnetic properties of this alloy. An 
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a 24-wire-dot matrix printer. This printer operated at a printing 
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Analysis of Kanji Structures 

and a Method for Recognizing 

On-Line Handwritten Kanji Characters 

by Means of Stroke Representative Points 


@ Yasuo Ishii 


(Manuscript received February 27, 1987) 


This paper shows how structural information with a 99.99 percent stability rate can be 


extracted from handwritten kanji characters. 


It also shows how kanji patterns can be resolved stably. A new on-line kanji character 
recognition method achieved a success ratio of 99.5 percent in recognizing kanji characters 


written in an unspecified style. 


1. Introduction 

The recognition of handwritten kanji charac- 
ters is the focus of character recognition 
research in Japan. 

The number of kanji categories to be rec- 
ognized is very much larger than the number 
of alphabetic categories. 

For example, the Japanese Industrial 
Standards (JIS) Committee has created a 
standard set of kanji characters”. This set 
contains 6349 categories which are divided 
into two levels. The 2965 JIS level-1 characters 
form the basic set. The remaining 3 384 kanji 
characters are termed JIS level-2 characters. 

The great number of kanji categories in 
wide use is one of the difficulties of kanji 
character recognition. Generally speaking, the 
structure of kanji is quite complex. A complex 
kanji character is a combination of several 
basic patterns. Replacing one of the several 
basic patterns with another makes another kanji 
character. For example, replacing a part of the 
kanji character ## turns it into the kanji char- 
acter "E 

The above example shows that not only 
are the structures of kanji characters complex, 
but also some kanji characters closely resemble 
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each other. This is the second reason why 
kanji character recognition is difficult. In the 
case of handwritten kanji character recognition, 
individual handwriting styles show significant 
differences in the patterns. This is the third 
reason why recognition is difficult. 

Character recognition methods are classified 
as on-line character recognition and off-line 
character recognition. The former is usually 
easier to implement. For kanji character recogni- 
tion, better results have been reported for 
on-line recognition than for off-line recognition. 

For example, in on-line handwritten kanji 
character recognition in which both the stroke 
order and the number of strokes are correct, 
results with a recognition ratio of 99.7 percent 
were achieved by Odaka et al., in 1980, using 
mixed texts of kana and kanji characters”), 

Results were also reported for the case of 
unrestricted stroke order by Odaka et al.” in 
1982: for the case of unrestricted stroke order 
and number of strokes by Wakahara and 
Odaka® in 1983; and for the case of cursive 
characters by Sato and Ichihara” in 1983, and 
by Wakahara and Umeda” in 1984. 

As shown by these achievements, restrictions 
on the stroke order and number of strokes for 
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on-line handwritten kanji character recognition 
have been lifted. However, this was only 
achieved by registering in a dictionary as many 
different patterns as required for different 
writers, and by using a dynamic programming 
matching method to match the dictionary 
to the input pattern. As a result, both the 
dictionary size and matching time are increased. 

We chose to examine what would happen 
if kanji character patterns contained stable 
information that is only dependent on the 
pattern shape, regardless of differences among 
writers. 

If stable information can be extracted, then 
the use of it could free kanji character recogni- 
tion from stroke order restrictions. In addition, 
the dictionary size would decrease and the time 
taken to match the dictionary and input pattern 
could also be cut down. 

This paper describes the analysis of kanji 
structures using this approach and the results 
obtained from recognition experiments. 

Chapter 2 of this paper describes the matrix 
representation of kanji structures and the 
experimental way in which information was 
obtained about stable kanji characters. Chapter 3 
describes the methods used to resolve kanji 
structures. Chapter4 describes kanji recogni- 
tion experiments based on the methods used 
to resolve kanji structures. 


2. Structure of kanji characters and matrix 
representation of characters 
2.1 Structure of kanji characters 

The basic element of a kanji character 
is the stroke. 

The two simplest kanji characters (‘‘ichi”’ 
meaning ‘“‘one’’, and ‘“‘otsu’’, meaning ‘“‘the 
second or the latter’) and the most complex 
kanji character (“‘noh’, meaning “a stopped 
nose”) were selected from a medium-sized 
Chinese-Japanese dictionary® and are shown in 
Fig. 1. 

If a kanji character contains more than 
two strokes, its structure can be represented 
by all the positional relations of all its pairs 
of strokes. 


5] 
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Be 
TR 


Ichi Otsu Noh 


Fig. 1—Simplest kanji characters and a complex kanji 
character. 


The strokes of a kanji character are 
numbered to correspond to their correct stroke 
order. Assume that the positional relation of 
the i-th stroke (S;) and the j-th stroke (S;) is 
Gj (the exact definition of Gj will be given 
later). We call the matrix representation of 
the structure of this kanji character, | Gj |, 
its structure matrix. 

Now, let us define Gj, that is, the positional 
relation between strokes S; and S;. Kato et al.”) 
defined one point for each stroke, and called 
it the representative point of the stroke. It is 
not easy to precisely define the positional 
relations of strokes. However, the positional 
relations between representative points can be 
easily defined. For this reason, Kato et al. 
devised and used the latter instead of the 
former. Although representation varies some- 
what in this paper, the essential theory of 
definitions follows that of Kato et al. 

Assume that the representative points of 
the two strokes S; and S; and are D; and D,;, 
respectively, and their coordinates are (X;, 
Y;) and (X;, Y;). 

In this paper, four representative points 
were considered for strokes; namely, the Begin- 
ning Point (B), Ending Point (E), Midpoint 
(M), and the Center of Gravity (G). However, 
the discussion presented below is independent 
of the type of representative point used. 

The relative positional relations Gj between 
the representative points D; and Dj; is rep- 
resented as a two-digit value. The first digit 
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represents the left-to-right relation, while the 
second digit represents the top-to-bottom 
relation. 

Assume the value of the first digit of Gj to 
be L if D; is not located to the right of Dj, 
that is, if X; <X;; and assume the value of 
the first digit of Gj; to be R if Dj is located 
to the right of D;, that is, if X; > X;. 

Assume the value of the second digit of Gj 
to be U if D; is not located below Dj, that is, 
if Y; 2 Y;; and assume the value of the second 
digit of Gj; to be D if D; is located below D;, 
that is, if Y; < Y;. 

Summarizing the foregoing, if i#/ 


Ga=LOlEDIRG|RD,  scseus (1) 
where, if i =/ 
=. = #* 
ij : 
The difference between kanji categories 


is not considered in the above notation. To 
distinguish between categories, the symbol 
representing the category is added to the right 
shoulder. For example, the i-th stroke of 
category k is SF, and its representative point 
is Df. 


2.2 Stability of structure matrix 

Kato et al.” stated that the relative posi- 
tional relation between the _ representative 
points of the strokes is stable. Therefore, we 
will make structure matrices | Gy | for many 
handwritten patterns to check their stability. 

2.2.1 Definition of stability 

Select a pattern from an arbitrary category 
k, and observe the relative positional relation 
Gj; between its two strokes S; and S;, and the 
value of its first digit is either L or R based 
on the definition. 

For m patterns selected from category 
k, assume the number of patterns for which 
the value of the first digit of Gj was L to be /, 
and the number of patterns for which the value 
of the first digit of Gj was R, to be r. Then, of 
course,/+r=m. 

We define the horizontal stability of S; and 
S; as follows: 


KM (K, 7) = =r im, 
it is obvious that 
OX Ss 1. 


If X =1, then /=m or r — m, which means 
that the value of the first digit of Gj are L or 


Table 1. A 200 kanji categories in common use, taken 
from the beginning of the JIS level-1 code 
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Table 2. All the ten-stroke kanji categories in common use (203 categories) 


wt 


Be 
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Ww 


et = BF 


ke 


te ot «Ft Mi tt 
mh ras} 
- # FS F Mw Bk htt 


# 


ny) 


a RR RK HK BE 


MOS 
ct 


FER i 


iil 


ii 
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R, respectively. Further, if X =0, then /=r, 
which means that half of the values of the first 
digit of Gj are L and the remainder are R. 

Similary, let us define the vertical stability 
direction of S; and S; as follows: 


Y (k,i,j) = |u-—d|/m, 


where u is the number of patterns for which 
the value of the second digit of Gj is U, and 
d is the number of patterns for which the value 
of the second digit of Gj is D. 

Then, obviously, 


ef tar eeere manta te (4) 
and 
OA FR1  <teasssicegecess (5) 
2.2.2 Pattern collections for stability experi- 
ments 
On-line handwritten printed-style kanji 


patterns are collected. It must be possible 
to compare kanji patterns which belong to 
the same category easily and positively. To 
ensure this, only patterns with the correct 
number of strokes and the correct stroke order 


were adopted. To maintain consistency in 
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the experimental data, all categories were 
written by a specific group of writers. Forty 
writers in their twenties and early thirties 
were selected, including four women. 

Experimental patterns must be in a small 
set which is representative of all kanji characters. 
For this purpose, two unique sets were used. 
The categories contained in each of these sets 
are shown in Tables | and 2. 

Each of these sets contains kanji categories 
that contain kanji characters for personal names 
and common kanji characters. Table 1 shows 
the first 200 categories as arranged in the 
order of the JIS level-1 codes, while Table 2 
shows 203 categories of kanji characters with 
ten strokes. 

Table 1 features kanji characters with 
different numbers of strokes. Table 2 features 
a maximum of kanji characters having the 
same number of strokes. 

A data collector as shown in Fig. 2 was 
used to obtain the required stroke patterns. 
This device consists of a tablet for recording 
on-line handwritten patterns, connected to 
a FACOM FM-8 personal computer. 
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1) Data was collected as follows: ‘‘A guide to 
writing kanji patterns (with their stroke 
order)’ was shown to all persons taking part 
in the experiment (see Fig. 3). 


write the kanji patterns with the correct 
number of strokes and using the correct 
stroke order. Please rewrite if you make a 
mistake.” 


2) The following request was made: ‘‘Please 3) When data collection was complete, all 
patterns were reproduced using a data check 
CRT system, and the data collected from the 
FM-8 
Fig. 2—Kanji pattern collector. Fig. 3—Sample of a guide to write kanji patterns. 
fae |= |] | = || SS ae lle 
= ie eo I = ‘a | | Reef | catia el we. |) 
QO AD AR | 7» (pa (De | aa) AD Ap || no 
f = 71 Ee f 
2 iiss 2/6 | |Z ||# je | Bie 
Bo 5 AD |liag || 79 |S |i |lve || || Fg 
| [te ee | ees a5. | | ieee. ol Pee Et 
S| 2. ie a Se iZ|/¢2/B12 ig 
Fe || 0 | sa || Po so || WO || 79 laa || He || Fo | 
| |e || a || ey] | | Se) ae ae 
S|B2il¢4leZl/S$ lSalal 2] sl 
Fo || FO || oe || 9 || Fa || Wo || AO || We || wo a) 
A 13 a se [cl”p |-iA ] Sa! bed = ~ ie 
Al | 14! | ote] | sel rey yo} ff Te) 718} |=) |) 42) 
re] \ ‘a ay \r Masel Neat x gs! ~) 
=\4 ~ \ a v { a a, . 
“1 W216) || Vi /\o digs ss\ || Mal |) 7] 7 i] Peel 
fics, J - i _id IF) eo ari) \ cia =| -\-) 
4 ley | [aate 7ta} || > Yo} a] slal |e) lle 


sia 
Aol 
= = i \JA peed |i 


fi r 
me ee ee a 2 
L 74 8 || 7 


Fig. 4—Sample of collected kanji patterns. 
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forty writers were checked visually. 

4) Whenever incorrect character 
detected, it was rewritten by its original 
writer. 

The shapes of some of the collected patterns 
are shown in Fig. 4. 

The individual pattern data consists of the 
kanji character code, writer number (0 through 
39), and stroke codes arranged in stroke order. 
Stroke codes are sets of point coordinates 
on a two-dimensional coordinate system based 
on a combination of time sampling (once 
every 20 ms) and shifting distance sampling 
(shifts of less than 0.3 mm were ignored), and 
the accuracy of the codes is 16 bits. 

For the experiments, the strokes of the 
originally collected kanji patterns were reedited 
into sets of several points of equal intervals. 

Forty patterns were collected for each 
category, all of which were printed-style kanji 


an was 


characters, handwritten on-line patterns with 
the correct number of strokes and in the correct 
writing order for use as experimental data. 

A total of 403 categories (200 categories 
in the first set and 203 in the second set) were 
collected, or 16 120 patterns in all. 

2.2.3 Stability experiment 

Experiments were made on the stability 
of the positional relations of representative 
points of strokes, using the patterns written 
by 40 writers on the 200 categories of kanji 
characters shown in Table 1 (200 x 40 = 8000 
patterns). 

Two of the 200 categories were excluded 
because they contain only kanji characters 
of one stroke, and the remaining 198 categories 
were considered. If two arbitrary strokes are 
considered as a pair of strokes within the charac- 
ter group, then a total of 10412 pairs can be 
made. 


Table 3. Frequency distribution table showing positional stabilities of each representative point 


Beginning point Ending point Midpoint Center of gravity 
(B) (E) (M) (G) 
Stability 
(A) x y x y x y x y 
0 61 30 42 43 62 29 43 31 
0.05 117 60 94 124 123 78 104 49 
0.10 Li2 66 102 105 109 719 OF 68 
0.15 108 61 83 110 107 54 88 52 
0.20 118 82 98 122, Lg 73 108 64 
0.25 126 79 88 ily Ib 106 68 96 62 
0.30 128 82 94 93 114 61 116 68 
0.35 123 84 106 128 139 74 97 75 
0.40 124 91 97 107 127 72 122 63 
0.45 145 92 113. 129 108 15 [ 127 67 
0.50 152 101 102 129 107 97 121 93 
0.55 150 85 108 130 97 92 102 90 
0.60 158 121 116 121 116 103 127 87 
0.65 173 93 114 127 119 Vz 139 112 
9.70 | 173 116 120 | 136 110 97 147 124 
0.75 180 148 172 130 138 124 160 127 
0.80 190 180 196 164 153 162 201 157 
0.85 255 192 215 172 223 169 233 gall 
0.90 320 252 350 281 280 237 313 246 
0.95 481 374 553 426 487 380 494 400 
1.00 7018 8 023 7449 7518 7 468 8171 | 7377 8 206 
Total 10 412 10 412 10 412 10 412 10 412 10 412 10 412 10 412 
Mean 0.868 1 0.914 5 0.895 1 0.883 3 0.883 1 0.9196 0.887 6 0.924 8 
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Since 40patterns are prepared for every 
category, 40 relative positional relations can be 
observed for each pair and stability can be 
calculated for the pair. — 

The Beginning point (B), Ending point (E), 
Midpoint (M), and the center of Gravity (G) 
were selected as representative points for the 
strokes. For each type of point the following 
experiment was carried out: The horizontal 
stability (X) and vertical stability (Y) were 
calculated for 10412 sets of strokes, and the 
frequency distribution for every 0.05 was 
obtained for the stability range from 0 to 1. 
Results are shown in Table 3. 

Table 3. shows the overall stability of the 
relative positional relations between representa- 
tive points. The mean stability is between 0.87 
and 0.92 and, therefore, can be said to be 
stable as a whole. However, the relative 
positional relations with stability of 1.00 (that 
is, those virtually without individual differences) 
were only 67.4 percent to 78.8 percent. There- 
fore, the positional relations are significantly 
affected by individual differences, which seems 
to prevent their use as stable structural informa- 
tion. 

Kato et al. avoided this problem by register- 
ing individual information, but this is not 
a desirable solution. Therefore, we will examine 
possible causes of instability of the positional 
relations between strokes to determine whether 


Fig. 5—Kanji pattern we, 


note: Henceforth, we will refer to the representative 
points by capital letters, as follows. (B): beginning 
point, (E): ending point, (M): midpoint, and 
(G): the center of gravity. 
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instability is caused by differences among 
writers or is due to the method used for 
observing the structural information. 

2.2.3 Review of the experiment 

Table 3 shows that neither the horizontal 
stability nor the vertical stability is stable 
enough. 

However, let us examine Fig. 5. The top-to- 

bottom relations between strokes a and b, 
b and c, b and d, c and e, and d and e are all 
stable. 
n addition, the left-to-right relation be- 
tween c and d is stable. This means that these 
types of relations are not apt to be affected 
by individual differences. However, the left-to- 
right relation between a and b and the top- 
to-bottom relation between c and d may be 
significantly affected by individual differences. 


— 


Therefore, we simply select either the 


Table 4. Frequency distribution table showing positional 
stability A of each representative point 


A B E M G 
0 0 0 1 0 
0.05 0 0 3 0 
0.10 0 3 1 0 
0.15 0 1 1 0 
0.20 0 2 0 0 
0.25 0 9 4 0 
0.30 0 5 1 4 
0.35 2 13 2 4 
0.40 3 14 4 1 
0.45 6 11 q 3 
0.50 9 14 3 9 
0.55 12 15 5 8 
0.60 ig 27 7 6 
0.65 26 ligt 4 11 
0.70 21 16 7 8 
0.75 33 19 6 13 
0.80 42 28 8 7 
0.85 25 23 20 12 
0.90 70 78 27 22 
0.95 116 155 33 61 
1.00 10 000 9 968 10 246 10 243 
Total 10 412 10 412 10 412 10 412 
Mean 0.992 7 0.990 3 0.996 3 0.996 5 
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horizontal stability (X) or vertical stability ( Y) 
— whichever is better — and define this to be the 
stability of the two strokes. 

That is, assuming the stability of the two 
strokes Sk and Sk to be A (k,i,/), 


AUF) = max 1 XG), G.I}. 
ee (6) 


Using this new definition, the frequency 
distribution of positional stability (A) for 
the same data shown in Table 3 is calculated 
as shown in Table 4. 

The average of stabilities of the A distribu- 
tion is more than 0.99, and the percentage 
of those with a stability of 1.00 is 98.41 percent 
for M, 98.38 percent for G, 96.04 percent 
for B, and 95.74 percent for E, respectively. 

Although — stabilities vary according to 
the type of representative point, it is apparent 
that the percentage of those with a stability 
of 1.00 has been improved significantly and 
that structural information is more stable than 
that shown in Table 3. 

To be more specific: 

Stable observation can be performed by 
merely observing X if the relative positional 
relation between the i-th stroke and the j-th 
stroke of a given kanji structure seems intuitive- 
ly to be in the left-to-right relation, and by 
merely observing Y if the relative positional 
relation between the i-th stroke and the j-th 
stroke of a given kanji structure seems intuitive- 
ly to be in the top-to-bottom relation. 

In this way, by limiting the direction of 
observation for each set of kanji strokes to 
either left-to-right or top-to-bottom, and 
through more stable observation, the observed 
stability of Gk is increased. 

Therefore, the instability of the relative 
positional relations between the representative 
points shown in Table 3 is due to the method 
used to observe the information, rather than 
due to individual differences. 

From Table 4, however, it is obvious that 
none of the representative points is sufficiently 
stable and that the stability varies for different 
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types of representative points. 

Next, we will extract cases having lower 
stability and analyze them for each representa- 
tive point. 

2.2.4 Analysis of cases with low stability 

For each of the four types of representative 
points, ten A’s (k,i,7/) were output in ascending 
order of stability. This was done to determine 
which stroke pairs in which categories have 
low stability in the positional relation. 

The actual characters contained in category 
k and the shapes of combinations of S¥ and Sk 
were checked and shown in Tables 5 through 8. 

For example, if we select the characters 
having low Midpoint stability from Table 4, 
there are ten characters ranging from A =0 to 
A =0.25. 

The details of these ten low stabilities are 
shown in Table 5. 

In Table 4, similarly, the ten low stabilities 
of G are distributed from A = 0.30 to A = 0.45, 
in ascending order, and their details are as shown 
in Table 6. These are the same for both B and E. 

The following conclusions can be drawn 
from checking Tables 5 through 8: 

1) Unstable midpoint factors 

Table 5 shows that relative positional 
relations become unstable when the two strokes 
intersect and when the midpoints are close to 
each other. 

2) Unstable center of gravity factors 

From Table 6, the’ relative positional 
relations become unstable when the two strokes 
intersect and when the centers of gravity are 
close to each other, such as in the relations 
between the 6th and 10th strokes of the kanji 
character .§) and the 3rd and 7th strokes of the 
kanji characters FY. 

3) Unstable beginning point factors 

Table 7 shows that the relative positional 
relation becomes unstable when the beginning 
points of the two strokes are close to each other. 
4) Unstable ending point factors 

Table 8 shows that the relative positional 
relation becomes unstable when the ending 
points of the two strokes are close to each other. 

Therefore, regardless of the type of the 
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Table 5S. Unstable portions in the case of the midpoint 


— Unstable pairs of strokes 

nstabdie 

AofM aharsckex —— Stroke number 

i j 

0 [ea] } 3 4 
0.05 * + 1 2 
0.05 3 + 1 2 
0.05 fi + 5 6 
0.10 wife + 7 9 
0.15 #E + 7 8 
0.20 5 + 4 5 
0.25 be + 7 8 
0.25 {ty + 4 5 
0.25 fia] + 9 10 


Table 6. Unstable portions in the case of the center of 


gravity 
e ‘ Unstable pairs of strokes 
nstable 
AofG dhadietor Rdjern Stroke num ber 

i ] 

0.30 ye x 2 3 
0.30 ER TJ 6 10 
0.30 ‘He + 8 9 
0.30 ibe + 1 2 
0.35 ca + 1 2 
0.35 SAL P 6 7 
0.35 tE + 3 4 
0.35 FE 4: 6 A 
0.40 VE | 6 7 
0.45 SBI * 3 7 


Table 7. Unstable portions in the case of the beginning 


point 
Unstable pairs of strokes 
A of B biedira ae Stroke number 

i j 

0.35 He | fa rs “| 7 
0.35 HE ie 10 12 
0.40 He n 1 p) 
0.40 (Sy /- 2 4 
0.40 fil N 3 4 
0.45 He 7 l 2 
0.45 fife i 9 11 
0.45 te oe 11 17 
0.45 [H] = 1 a) 
0.45 fH] \7 1 y) 
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Table 8. Unstable portions in the case of the ending 


point 
Unstable pairs of strokes 
AofE Unstable Stroke number 
character Pattern = or 

i i 
0.10 + aT 8 | 10 
0.10 [n] al 4 5 
0.10 if = -) 3 
0.15 i=l sy 2 5 
0.20 fie =) 4 5 
0.20 ae = 18 20 
0.25 fi =I 3 4 
0.25 He a 11 12 
0.25 EX = 11 12 
0.25 fea] =) 12 3 

a) b) c) 


Fig. 6—Sample patterns having representative points 
which are near to each other. 


representative point, the positional relation 
between the representative points becomes 
unstable when the specific representative points 
are close to each other in the kanji structure. 

It is reasonable to conclude that when points 
are essentially close to each other, regardless 
of the writer of the kanji character, the relative 
positional relation between those points 
becomes unstable. 

This fact indicates the measures to be taken 
to obtain structural information that is more 
stable than that shown in Table 4. This topic 
will be discussed in the next subsection. 

2.2.5 Dynamic selection of representative 

points 

Structures such as those shown in Fig. 6 
often appear in kanji characters. 

1) Case a: 

If the midpoints or the centers of gravity are 
selected as the representative points, positional 
relations may become unstable, but they may 


become stable if the beginning points or ending 
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Table 9. Frequency distribution of the best stability using two kinds of representative points 


A max Band E BandM BandG EandM EandG MandG 
0.45 l I 
0.50 1 1 1 1 3 
0.55 5 
0.60 1 2 
0.65 5 
0.70 5 
0.75 7 
0.80 5 
0.85 9 
0.90 5 5 3 16 
0.95 11 6 6 6 13 41 
1.00 10 394 10 405 10 405 10 400 10 395 10 313 

Total 10 412 10 412 10 412 10 412 
Mean 


points are selected. 
2) Case b: 

If the beginning points are selected as 
representative points, the relative positional 
relation may become unstable, but it may 
become stable if other representative points are 
selected. 

3) Case: ¢: 

If the ending points are selected as repre- 
sentative points, the positional relation may 
become unstable, but it may become stable if 
other representative points are selected. 

Which representative points should be 
selected for the two arbitrary strokes S; and S;? 
If the strokes are as shown in case a) of Fig. 6, 
neither M nor G should be selected as the rep- 
resentative point. If the strokes are as shown in 
case b), B should not be selected. If they are as 
shown in case c), the ending points should not 
be selected. 

To see how much stability can be 
maintained by making this selection, the stabili- 
ties of ci for the two types of representative 
poins P and @Q are defined respectively as follows: 


Ap = | A(k,i,j) forP }, rusakel 
Ag = | A(k,i,j)forQ} . Lean) 
Then, the following is obtained: 

Amgx = max(Ap,Ag)s = st nas (9) 


The frequency distributions of Ajax are 
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Table 10. Frequency distribution of the best stability 
using three kinds of representative points and 
the instable cases 


Unstable cases 


@¥ (11, 17) 


Amax Frequency 
0.50 1 
0.55 
0.60 
0.65 
0.70 
0.75 
0.80 
0.85 
0.90 
0.95 2 
1.00 10 409 


a¥ (8, 9) FR (12, 13) 


shown in Table9. Table9 is equal to the 
distribution of the stability of Gh, where the 
more desirable representative points of the 
two types were specified for each set of SF and 
Sf. Table 9 shows that M and G behave similarly. 

Now, let us increase to three the types 
of representative points to be observed, and 
consider cases where the most desirable rep- 
resentative points are always selected from 
among the three types. However, since M and 
G have similar behavior, only M will be used. 
Therefore, only the types of representative 
points that give the best stabilities of ck need be 
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Fig. 7—Kanji pattern {\/.. 


selected from among the beginning points, 
midpoints or ending points. 
This can be expressed as follows: 


Amax = max(Ap,Am,Ag). ..... (10) 


This is shown in Table 10. 

From this result, if the most desirable rep- 
resentative points are selected from _ these 
three types on a case-by-case basis to check 
the stabilities, it can be verified that the mean 
stability is 0.999 9. 

In this way, it has been shown that if the 
category of the kanji character and the two 
strokes contained therein are specified and 
a dictionary is prepared specifying the type of 
the representative points and the direction to 
be observed (that is, the left-to-right or the 
top-to-bottom direction, whichever is stable) 
as the positional relation between the rep- 
resentative points, even handwritten kanji 
characters have very stable structural informa- 
tion, except in special cases. In fact, of the 
10412 sets of kanji patterns used in this paper, 
10 409 sets show a stability of 1.00, and two 
sets (the 8th and 9th strokes of #&% and the 
12th and 13th strokes of #& ) show a stability 
of 0.95. The remaining sets were the 11th and 
17th strokes of f#€, which shows that the 
position of the 17th stroke (a dot) fluctuates 
significantly, depending on the writer. 

2.2.6 Summary 

The foregoing is summarized as follows: 
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Fig. 8—Pattern divisible into left and right. 


When we have determined the kanji category 
and the two strokes S; and S; that are to be 
observed, a mean stability of 0.999 9 for the 
values of Gt can be observed by specifying the 
following: 

1) The type of representative point to be ob- 
served, and 

2) The direction of observation (top-to-bottom 
or left-to-right). 

Since there is a 99.99 percent stability in the 
values found for Gk, by observing information 
inherent to the kanji characters alone, the 
structure matrix Gk, in which the i-th and j-th 


elements are selected according to points |) and 
2) above, also contain sufficintly stable informa- 
tion. 


3. Resolution of kanji structure 
3.1 Examples 

Even a complex kanji pattern often consists 
of divisible sets of simple patterns. For example, 
the kanji character fi. shown in Fig. 7 as: a 
composite of pattern { (the left-side radical 
meaning “man”) and pattern 4 . As shown in 
Fig. 8, the kanji pattern P can be divided into 
two patterns P, and P, for simpler representa- 
tion. 

Another example is the kanji character 
46 which isa composite of pattern ( — ‘‘orass’’) 
and pattern 1t , and can be divided from top 
to bottom as shown in Fig. 9. 

Let us pursue this analysis further. { can 
be divided from top to bottom, and ‘4 can be 
divided from top to bottom to obtain “ and 

| aa 

Continuing with this process, a kanji struc- 
ture can be resolved into individual strokes. 
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Fig. 9—Pattern divisible into top and bottom. 


Fig. 10—Kanji pattern vr. writing in irregular order. 


For example, the kanji character {2 can 
be resolved into seven components. These 
strokes are numbered from | to 7 for identifica- 
tion, as shown in Fig. 7. Each of these numbers 
is called the dictionary stroke order. The stroke i 
in dictionary stroke order is $;. The order 
of actual writing of each stroke (different 
from the dictionary stroke order) is as shown in 
Fig. 10. 

Assume that only the actual writing order 
is known. Now let us consider how we can find 
whether the stroke whose actual writing order 
is x,, for example, is equal to 1 in the dictionary 
stroke order. 

It is known that the midpoints of strokes | 
and 2 in the dictionary stroke order of this 
knaji pattern are stably located to the left of the 
midpoints of strokes 3 through 7. Therefore, 
after the midpoints of each stroke of an actual 
written pattern are obtained, and the two mid- 
points on the left are selected, then the stroke 
corresponding to the two midpoints will be the 
two strokes whose stroke orders are x7 and x, 


188 


Fig. 11—Partial tree structure representation of the kanji 
character {if . 


Fig. 12—Tree structure representation of the kanji 
character fi. . 


(that is, strokes | and 2 in the dictionary stroke 
order). 

Next, if, of the two strokes, the stroke whose 
M is on top is selected, it should correspond to 
the stroke whose writing order is x, (stroke | in 
the dictionary stroke order). These processes are 
represented as shown in Fig. 11. 

L2 means that two midpoints are to be 
taken from the left side. p indicates the set of 
the selected strokes, while r indicates the sets of 
remaining strokes. For the two selected strokes, 
U1 means that one M is to be taken from the 
top. The selected stroke is S$, , and the remaining 
stroke is $4. 

For S3 through S,, similarly, by repreating 
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Fig. 13—Kanji pattern fir representating beginning 
points of strokes. 


access to the patterns from the top or bottom 
or from the left or right and by observing 
the midpoints, the dictionary stroke order can 
be learned, even if the actual writing order is 
different from the dictionary stroke order. 

An example of this method is shown in 
Fig. 12. 


3.2 Possibility of structure resolution 

Now, let us examine further whether kanji 
patterns can be resolved in practice. 

The stable positional relation of the two 
strokes ensures that at least either the top-to- 
bottom relation or the left-to-right relation 
of the representative points of the two strokes 
is stable. 

Now, let us consider strokes | and 4, shown 
in Fig. 13. Here, the black dots of the pattern 
shown in Fig. 13 represent the beginning points 
of strokes. Since the top-to-bottom relation 
between the beginning points of strokes | and 4 
is stable, it can be said that the relative position- 
al relation between strokes 1 and 4 is stable. 
However, it is obvious that the left-to-right 
relation between the beginning points of strokes 
1 and 4 is not stable. Therefore, if two beginning 
points are to be taken from the left side, strokes 
1 and 2 are not always selected. That is, strokes 
2 and 4 could possibly be selected. 

In other words, high stability of the relative 
positional relation between strokes does not 
always guarantee arbitrary resolution of the 
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Fig. 14—Vertically divisible pattern and its structure 
matrix. 


kanji pattern. 

Therefore, an experiment is necessary to 
ascertain the possibility of resolving the kanji 
pattern. For this reason, we describe below 
the algorithm for determining the possibility 
of resolution by viewing the structure matrix 
of a kanji pattern. 

In Fig. 14, patterns P, and P, are patterns 
that are arranged from left to right. Assume that 
these two patterns make up a character (calling 
its pattern P). 

As shown in Fig. 14-b), the structure matrix 
of P consists of four matrices. That is, the two 
square matrices g, and g,, which correspond to 
the structure matrices of patterns P, and P, 
respectively, and the two matrices names / and r. 

Matrix / is a matrix such that all its elements 
contain value L, while matrix r is a matrix such 
that all its elements contain value R. Matrices / 
and r indicate that P, is located to the left 
of P,. 

If it is known that the arbitrary structure 
matrix g consists of matrices / and r and two 
square matrices g, and g, as shown in Fig. 14, 
it is clear that pattern P, whose structure matrix 
is g, can be resolved into two patterns, P,; and 
P,, whose structure matrices are g, and gp, 
respectively. It is also apparent that P, is located 
to the left of P,. 

Similarly, assume uw to be a matrix, all 
elements of which contain value U, and also 
assume d to be a matrix, all elements of which 
contain value D. In this case, if it is known 
that the arbitrary structure matrix consists 
of matrix uw, matrix d, and two square matrices 
g, and g, as shown in Fig. 15, it is obvious 
that pattern P whose structure matrix is g, can 
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Fig. 15—Horizontally divisible pattern and its structure 
matrix. 


be resolved into two patterns, P,; and P,, that 
are in top-to-bottom relation. 

From the foregoing study, whether an 
arbitrary pattern can be resolved into patterns 
in the top-to-bottom relation or left-to-right 
relation depends on whether the upper right 
matrix corresponds to either matrix / or matrix 
u, after the square matrix at the upper left and 
the square matrix at the lower right of the 
structure matrix g of the pattern have been 
determined as shown in Figs. 14 or 15. (If the 
upper right is matrix /, the lower left is always 
matrix r, and if the upper right is matrix uw, 
the lower left is always d, and vice versa.) 

Whether pattern P can be resolved can be 
determined by checking the matrix at the upper 
right while gradually increasing the size of 
the square matrix at the upper left starting at a 
value of one. 

Let us see how this applies to a specific 
example, using the structure matrix (matrix g) 
shown in Fig. 16. By examining this matrix we 
can determine whether the original pattern can 
be resolved. Taking the size of g, as | at first, 
since the matrix at the upper right (matrix /) is 

h=(RU RU LD LU LU LU), 
it is apparent that it is neither the / type nor u 
type. Then, by increasing the size of g, to 2, 
the matrix at the upper right (/) becomes 


| RU LD LU LU | 
RU LD LD LU LU J 


This matrix is neither / type not wu type. Next, 
increasing the size of g, to 3, matrix h is 
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[ex RURULDLULULU| 
|LD ** RULD LD LULU 
(LD LD ** LD LD LD LD 
|RU RU RU *& LU RU LU 
RD RU RU RD ** RU LU 
|RD RD RULD LD ** LU 
|RD RD RU RD RD RD ¥ | 


Fig. 16—Kanji pattern {K and its structure matrix. 


LD LU LU LU 
h=| ED ED LU LG |, 
LD LD LD LD 


this is obviously an /-type matrix because all of 
its elements contain value L. The matrix at the 
lower left is of course r type. At this time, 
g, is as follows: 


** RU RU 
g,=| LD** RU 
LD LD ** 


The remaining square matrix g, is as follows: 


we LU RU LG 
._ | RD ** RU EY 
82>) rp Lp ** tu ott (15) 


KD! RD RD ** 


Therefore, it is obvious that structure matrix 
g consists of two square matrices, g, and gp, 
and an / type matrix and an r type matrix, 
as shown in Fig. 14. Hence, the original pattern 
corresponding to g can be resolved into patterns 
in the left-to-right relation, in which case the 
left pattern has three strokes and its structure 
matrix is equal to g,, while the right pattern 
has four strokes and its structure matrix is 
equal to g,. 
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This result was induced mechanically, based 
only on structure matrix information. However, 
this result is very similar to the result of intuitive 
human judgement concerning the kanji pattern 
shown in Fig. 16. 

The foregoing describes the algorithm 
that determines whether the original pattern 
can be resolved by viewing structure matrices 
that correspond to individual kanji patterns. 

What if one category is found to contain 
many different hand-written patterns? If there 
should be, for example, 40 different patterns, 
we will create a new, integrated structure 
matrix that can be used to represent all 40. 

First of all, taking each element in order, the 
individual elements of all 40 matrices are com- 
pared with each other. If an element in any 
particular place in the matrix —For the sake of 
explanation, let us use the Sth element in the 
series as an example.— is found to have the same 
value in all 40 matrices, then that value will be 
used for the (Sth) element when creative the 
integrated structure matrix. If for some of the 
pattern matrices, this (Sth) element is found to 
have a different value from the value it has in 
the others, the portion of its value that does not 
match will be represented in the new integrated 
structure matrix as “?”. The new integrated 
structure matrix created in this way is called 
the structure matrix of the category and rep- 
resents the structure matrices of all 40 patterns. 
The probability of resolution of the representa- 
tive patterns of the category can be determined 
by applying the above algorithm to the category 
structure matrix. 


3.3 Experimenting with structure resolution 
When a structure matrix is given, it can be 
mechanically determined whether its original 
pattern can be resolved into patterns in the 
left-to-right relation or in the top-to-bottom 
relation. This is also true for the structure 
matrix representing the category. Therefore, 
let us carry out experiments on the probability 
of resolution of many categories. If a pattern 
has a significant fluctuation and therefore any 
element of the structure matrix representing 
the category has the valve “*??”’, the category can 
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not be perfectly resolved. 

3.3.1 Overview of experiments 

Experiments were carried out on each rep- 
resentative point, B, E, M, and G for 198 
categories out of the 200 character categories 
shown in Table | (excluding two catagories that 
have only one stroke). That is, experiments 
on the probability of resolution of the category 
structure matrices were carried out in the fol- 
lowing sequence, using patterns written by 40 
writters: 

1) Simple resolution of a category structure 
matrix with only one type of representative 
point. 

2) Using two of the category structure 
matrices, each one with a different type of 
representative point. An experimental re- 
solution is carried out on one type of 
representative point, and if the pattern 
cannot be resolved any further, then resolu- 
tion is carried out on an other type of 
representative point, and so on. 

3) Resolution is carried as far as possible in 
the same way as item 2) above, with three 
types of representative points. 

Due to the similarity of M and G, we did not 
use all four types and representative points 
in carrying out experiments on the category 
structure matrices. 

3.3.2 Probability of resolution of a struc- 

ture using a single representative point 

A simple analysis was made of a category 
structure matrix (Let us simply call it a “‘struc- 
ture matrix”, when no confusion is expected.) 
with a single type of representative point. That 
is, the given structure matrix was resolved into 
partial structure matrices. Next, an attempt 
was made to further resolve each partial struc- 
ture matrix until it became indivisible. Then the 
number of these indivisible partial structure 
matrices was counted. 

Assume the number of indivisible partial 
structure matrices of d-dimension to be Z (qd). 
Since the partial structure matrices of /-dimen- 
sion cannot be resolved any further, Z (/) 
indicates the number of strokes that can be 
resolved in the given category. 
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Table 11. Value distribution of Z/d) in the case of simple 
resolution of kanji structure 


d Z(d) 
B E M 

1 947 593 1 346 

2 192 51 83 

3 36 23 37 40 
4 24 5 6 
5 17 8 7 
6 7 4 6 
1 8 9 3 
8 11 5 0 
9 2 3 1 
10 2 2 0 
11 0 1 0 
12 3 2 1 
13 2 2 2 
14 3 1 0 
15 2 3 0 
16 2 1 1 
17 0 1 0 
18 0 0 0 
19 0 0 0 
20 0 0 0 


Table 11 shows the results of simple resolu- 
tion of the characters in the 198 categories 
used in this experiment. The sum of the strokes 
contained in these characters is 2010. Assuming 
the maximum value of d, where the value of 
Z (d) #0, to be dmax, the following expression 
holds: 

d m ax 


x dxZ(d)=2010. 
aah 


As shown in Table 11, the value of dyax is 
16, if the representative point is a B; 17, if the 
representative point is a M; and 19, if the 
representative point is an E. 

Also, assuming the percentage of patterns 
| out of the total numbers of strokes (2010) | 
that can be resolved into structures consisting 
of only one stroke to be q, then 

q=Z(1)/2010. 

This is regarded as the probability of reso- 
lution. 

Table 11, shows that considering the prob- 
ability of resolution for all representative 
points, the value g obtained in the case of the 
center of gravity is the largest. Even in this case, 
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Table 12. Probability of resolution of structure matrix 


Number of representative points 
used for resolution 


One kind 


Two kinds 


Three kinds 


however, the value, as shown in Table 12, is 
approximately 0.76, indicating that the use of 
a single representative point is not sufficient to 
determine the reconfiguration of the stroke 


order. 

3.3.3 Probability of resolution of a struc- 
ture using two types of representative 
points 

Two types of representative points were 

selected, and the category structure matrix for 
one of them was resolved as far as possible. 
When it became indivisible, the matrix for the 
other was then resolved further. Lastly, only the 
structure matrices that could not be resolved 
using both representative points were counted. 

Results are shown in Table 13. Confirmation 

was also made as to which of the representative 
points should be taken first—the one which 
achieved a better result, or the one which 
achieved an inferior result in Table 11. It was 
found better to resolve the structure matrix 
using as far as possible the type of representative 
point that achieved a better result, and after 
that, using another representative point. 

However, combinations of the centers of 

gravity and ending points are considered better 
that those too — similar centers of gravity 
and midpoints. Combinations of the centers 
of gravity and beginning points do best, which 
appears to indicate that it is better to select 
representative points that can supplement each 


other. 
The probability of resolution (q) is 0.95 


in the best case (where the resolution was 
carried out first with the G, and then with 
the B), which represents a significant improve- 
ment over the use of a single type of representa- 
tive point. 
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Table 13. Value distribution of Z/d) when using two 
kinds of representative points 


d Bnd BPM ang BUS snd BIS and BiG ant BiG and Mi 
1] 1651 | 1846 | 1913 | 1706 | 1 838 | 1700 
2 34 12 12 10 14 77 
3 7 5 2 3 7 14 
4 10 1 2 0 2 1 
5 1 4 2 5 4 3 
6 5 3 4 4 3 3 
7 3 1 0 4 2 2 
8 1 0 0 4 0 0 
9 1 1 0 2. 1 1 
10 1 0 0 1 0 0 
11 1 1 0 2 0 0 
12 3 1 1 1 1 1 
13 2 1 1 1 2 2 
14 2 0 0 2 0 0 
15 2 1 0 2 0 0 
16 1 1 0 1 1 1 
sey 0 0 0 1 0 0 
18 0 0 0 0 0 0 


3.3.4 Resolution using three types of rep- 
resentative points 

Three types of category structure matrices 
were used. Resolution was first performed 
using one of the three types of representative 
points, then another type, and finally the third 
type. Table 14 shows the results of resolution. 

Six experiments were performed on combi- 
nations of the B, E, and G, and six more ex- 
periments were performed on combinations 
of the B, E, and M. The distribution of the 
sizes of the structure matrices that could not 
be resolved did not depend on the sequence 
in which the resolution was carried out. There- 
fore, only the types of combinations and their 
results are shown. 

Since the G and the M are similar, experi- 
ments on combinations of the two were omitted. 

As can be seen from Table 14, the probabili- 
ty of resolution (q) for combinations of the B, 
G, and E was calculated to be 0.987. 


3.4 Review 
3.4.1 Categories that are difficult to resolve 


Combinations of representative points have 
improved the probability of resolution of 
structure matrices, as shown in Table 12. 
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Table 14. Value distribution of Z/d) when using three 
kinds of representative points 


d Z (d) 


B, M, E she B, G,E 
1 1 947 L973 
2 1 1 
3 0 0 
4 0 1 
= 0 0 
6 0 1 
7 0 0 
8 0 0 
9 0 0 
10 0 0 
1] 2 0 
12 2 1 
13 0 1 
14 0 0 
15 1 0 
16 0 0 
17 0 0 
18 0 0 


Table 15. Categories of difficult resolution 


Category 
BE WBE de Bae THE 


Selected representative points 


B,G,E 


However, to improve the probability of 
resolution, the categories containing indivisible 
partial patterns were checked and are as shown 
in Table 15. 

Some partial patterns in these characters 
do not satisfy the conditions for pattern resolu- 
tion for any of the representative points. 

The reasons for this are divided as follows: 

1) A stability of 99.99 percent between two 
arbitrary strokes can be guaranteed by 
the appropriate selection of representative 
points for either the top-to-bottom relation 
or the left-to-right relation. But it cannot 
be guaranteed that all the pairs of strokes 
always have the same left-to-right (or top- 
to-bottom) relation. 

2) The positional relation between two 
particular strokes in both the top-to-bottom 
and left-to-right direction may sometimes 
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Fig. 17—Left-to-right relation becomes sometimes 
unstable. 


a) Left-to-right b) Top-to-bottom 


Fig. 18—Two types of relational relations between 
leaning strokes. 


be unstable. 

In the case of 1) above, for example, in 
the kanji character f shown in Fig. 17, 
its 5th and 7th strokes are mutually stable 
in the top-to-bottom direction, but are some- 
what unstable in the left-to-right direction. 
In such a case, an attempt to resolve this 
character into a left portion containing strokes 
1 through 5 and a right portion containing 
the remaining strokes will not succeed, due to 
the instability of strokes 5 and 7 in the left-to- 
right direction. Another example of 1) is the 
kanji character ff . Since strokes 1 through 8 
are all located to the left of the remaining 
strokes as viewed from the G, this kanji 
character can be resolved into a left portion 
and right portion. If the beginning points, mid- 
points and ending points are considered, how- 
ever, the left-to-right relation between strokes 1 
through 8 and some of the remaining strokes 
cannot be completely guaranteed. As a still 
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another example, the kanji character fif€ cannot 
be resolved because the relation between stroke 
7 and strokes 8-11 cannot be established for the 
B, G, E (the M of stroke 7 is always above 
strokes 8 through 11). Difficulties in resolving 
the kanji characters Jt} and ™ | are considered 
to be equivalent to those in case 1) above. 

Case 2 is an example with significant posi- 
tional fluctuation due to individual differences, 
such as stroke 17 of the kanji character MH . 
As with the relation between strokes 8 and 9 
of the kanji character “#% and the relation 
between strokes 12 and 13 of the kanji character 
PK , Where the two diagonal strokes are 
parallel to each other, as shown in Fig. 18, the 
relation can be either a left-to-right relation or 
a top-to-bottom _ relation, depending on 
individual differences, and therefore neither 
the top-to-bottom relation nor the left-to-right 
relation of these characters as a whole is stable. 

3.4.2 Introducing the diagonal approach 

The preceding subsection reviewed the 
causes difficulty in resolving patterns for some 
categories. 

Temporarily ignoring the dot or 17th stroke 
(final stroke) of the kanji character f , 
which fluctuates significantly due to the writers’ 
habits, let us consider overcoming some other 
causes of instability with simple algorithms. 
One algorithm is a countermeasure for instabili- 
ty due to the relations shown in Fig. 18. In 
this case, the positional relation between the 
representative points, if not limited to the left- 
to-right or top-to-bottom relation, looks stable 
when viewed from upper left to lower right. 

In general, if both the top-to-bottom 
relation and the left-to-right relation are 
instable, something irrational must be done to 
resolve the kanji pattern into the top-to-bottom 
relation or the left-to-right relation, unless 
it is a case of significant individual differences. 

One method of solving this problem is to 
consider the stability of the upper left — lower 
right relation and the lower left — upper right 
relation in addition to the top-to-bottom 
relation and the left-to-right relation. In fact, 
much as for the kanji characters bil and 
where the character structure consists of upper 
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Table 16. Value distribution of Z/d) with the diagonal 
approach, using three representative points 
—— SS 


d Z (d) 

B, M, E B, G, E 
1 1999 1999 
2 0 0 
10 0 0 
11 1 1 
12 0 0 


left, upper right, and lower center portions, 
the interrelation of stroke sets should not be 
limited to the top-to-bottom or left-to-right 
relations. 

Based on the foregoing study, experiments 
were carried out for cases where resolution 
conditions include diagonal relations, in addition 
to vertical or horizontal relations. 

That is, the relations between stroke re- 
presentative points Dj; (x;,¥;) and Dj (x;, yj), 
x; + yj were compared to x; + y;, and the point 
with the smaller value was assumed to be the 
lower left and the point with the larger value 
the upper right. Similarly, x; — yj and x; — y; 
were compared, and the point with the smaller 
value was assumed to be the upper left and the 
point with the larger value the lower right. 
This is called introducing of the diagonal ap- 
proach. 

If this lower left — upper right relation or the 
upper left — lower right relation is stable, it is 
used to resolve the stroke sets. 

Resolution was — performed 
possible, by using three types of representative 
points and introducing the diagonal approach. 
The results obtained are shown in Table 16. 
Using the value of g, the probability of resolu- 
tion was evaluated to be g=0.995. 

This study, indicates that the probability 
of resolution for kanji patterns is 99.5 percent 
when the probability of the diagonal relation 
is considered in addition to the probabilities of 
the top-to-bottom relation and the left-to-right 
relation. 


wherever 
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3.5 Summary 

Each element of a structure matrix has two 
values, called the top-to-bottom relation and 
the left-to-right relation for each representative 
point of a stroke. If the most desirable observa- 
tion method is used on each kanji category 
and each set of two specific strokes, observation 
can be performed with a stability of 99.99 
percent. This was described in Subsec. 2.2.6. 

In this section, experiments were performed 
for resolving kanji structures. These experiments 
were based on the assumption that kanji 
characters can be resolved in the top-to-bottom 
direction or the left-to-right direction. Accord- 
ing to those experiments, it was found that 
up to 98.7 percent of kanji characters could be 
resolved. The principle of resolution is to 
observe as many specified representative points 
of a kanji character as specified in the diction- 
ary, from either the top-to-bottom or the left-to- 
right direction, and to keep repeating such 
operations. 

The probability of complete resolution 
was 98.7 percent for the categories shown in 
Table 1. This value is rather low, compared to 
the 99.99 percent stability of the relative 
positional relation between two arbitrary 
strokes of the same categories and patterns. 

The major reason for this, as reviewed 
in Sec. 3.4, is that the resolution of a kanji 
character should not be limited to its top-to- 
bottom relation or the left-to-right relation, 
some kanji characters having shapes that can 
be resolved stably in the diagonal direction. 

The principle of resolution was therefore 
improved so that a diagonal approach from 
either the upper left or the upper right direction 
is allowed for specific kanji characters, in 
addition to top-to-bottom and _ left-to-right 
resolution, and the probability of resolution was 
thereby raised to 99.5 percent. 

The technique introduced in Sec. 3.4 for 
resolving kanji characters is thus seen to be 
satisfactory. 
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Table 17. Structure dictionary of the kanji character 


(vi. 
Se 
piamaat Approaching Type of 
Steps of directs ; ; 
irection representative point 
strokes 
1 2 Left-to-right 


Top-to-bottom M 


4. Kanji character recognition by structure 
resolution 
4.1 Overview of the recognition algorithm 

As described in Chap. 3, kanji characters 
can be resolved into partial patterns containing 
only one stroke, with a probability of 99.5 
percent, by repeating the operations pre- 
determined for each specific category. 

The operations are to observe as many 
representative points as specified from the 
specific directions. 

A list of operations for the kanji character 
f. is shown in Table 17. Since this list of 
operations is a dictionary to be used for re- 
solving the kanji character {% , it is called 
the structure dictionary for the kanji character 
fir 

Table 17 shows only the operational 
portions of the contents of Fig. 12. 

Although it is not shown in Table 17, 
the execution of step 1 causes the strokes in 
stroke order 1 and 2 to be separated, and the 
execution of step 2 for the strokes in stroke 
order | and 2 causes strokes in stroke order | to 
be separated. 

Since the kanji character has been resolved 
into partial patterns with only one stroke by 
steps | and 2, step 3 is executed for the remain- 
ing pattern to separate the stroke of stroke 
order 3. Step 4 is executed for the remaining 
pattern, and the stroke of stroke order 4 is 
separated. Then, step 5 is executed for the 
still remaining pattern, and the strokes of stroke 
orders 5 and 6 are separated, leaving only the 
stroke of stroke order 7. Step 6 is executed 
for the patterns that have just been separated 
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(strokes of stroke orders 5 and 6), and as a 
result, the stroke of stroke order 5 is separated, 
leaving the stroke of stroke order 6. 

Thus, operations from step 1 to step 6 
cause the kanji character 1. to be separated 
into parts containing only one stroke. If separa- 
tion is successful, as shown in this case, the 
stroke order can be given for all the strokes in 
the original pattern (see Sec. 3.1). 

The patterns to be resolved can generally 
be summarized as follows: 

1) The entire kanji character is to be resolved 
by step 1. 

2) If the number of representative points to be 
separated by step i (number of strokes to be 
separated) is two or more, this stroke group 
is to be separated by the next step |step 
(i+ 1)}. 

3) If the number of representative points to be 
separated by step 7 is only one, the entire 
remaining part is to be resolved by step (i + 
1). However, if that remaining part consists 
of only one stroke, step i is the end step. 
A kanji character consisting of n strokes 
can be resolved with (nm —1) steps. 

Based on the above, and assuming that a 
given kanji pattern belongs to a certain category, 
a stroke order can be allocated to each stroke 
of the kanji pattern by applying the structure 
dictionary for the category. Then, using the 
allocated stroke order, each stroke is compared 
with the standard character pattern for that 
category to calculate the differences among 
strokes, and the sum of differences is considered 
to be the tentative difference of that character. 
This tentative difference is calculated for each 
predictable category (that is, for each of the 
categories whose number of strokes matches 
the number of strokes of the input pattern, 
and the category with the minimum tentative 
difference is regarded as the recognized category 
of the input pattern. 

If the input pattern is correctly recognized, 
then its structure will be correctly resolved; 
therefore, each individual stroke of the input 
pattern matches the corresponding dictionary 
stroke, so the difference will become sufficiently 
small. 
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However, in the other cases, the structure 
of the input pattern is resolved forcibly by an 
irrelevant structure dictionary, and the in- 
dividual strokes of the input pattern are forced 
to correspond to the individual strokes of an 
irrelevant dictionary. Therefore, the difference 
is much larger than that of a correct structure 
resolution. 


4.2 Definition of difference 

Following structure resolution, various 
methods are available for defining in detail 
algorithm for matching individual strokes 
with the corresponding strokes of the dictionary 
pattern (this is called stroke level matching), 
and needless to say, more research is necessary 
to select the best method. For convenience, 
Euclidean distance is used as the basis for 
calculating the difference, as this minimizes 
the effect of small differences. 

For actual stroke level matching between 
the input pattern and the dictionary pattern, 
the input pattern must be normalized to the 
dictionary pattern. That is, the centers of gravity 
of the input pattern is laid over the dictionary 
pattern, and the input pattern is adjusted (ex- 
panded or shortened) so that the secondary 
moment matches that of the dictionary pattern”. 
Following this normalization process, the i-th 
stroke of the dictionary pattern is assumed 
to be ¢;, the corresponding input stroke to 
be s;, and the N points that approximate each 
stroke to be P; and Qj (j=1,...,N), respec- 
tively. 

Assuming the Euclidean distance between 
Pj and Qj to be PQ;, the difference d; between 
s; and ¢; is as follows: 

1 N 

d; = re (PQ;). 

Assuming the difference of the entire 
pattern to be D, then 


n 
D=2% d;, 
i=] 


where n represents the number of strokes of 
this pattern. 
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4.3 Recognition experiments 

4.3.1 Overview of experiments 

The preceding section describes a new 
recognition technique that limits pattern 
matching to stroke-level matching, using the 
structure resolution algorithm, instead of 
matching the entire input pattern with the entire 
dictionary pattern. To verify the effect of this 
technique, two experiments were carried out. 
1) Experiment 1 

A recognition experiment was carried out 
based on the flowchart shown in Fig. 19, using 
the handwritten kanji patterns for 200 catego- 
ries written by 40 writers (see Table 1). 

A recognition ratio of 99.7 percent was 
achieved. 
2) Experiment 2 

Since many of the kanji characters used 
for experiment 1) above contain a different 
number of strokes and there are fewer categories 


Count number (7) 
of input strokes 


Take a SD 


Select all the 
structure 
dictionaries 

(SDs) of n-strokes 


Save the 
categorry 


Resolve kanji 
structure 


Normalize 


é D pin? = D 
input pattern ee 


Stroke-level 
matching to 
calculate d; 


Output the 
saved _ 
categories 


Fig. 19—Recognition flowchart. 
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Code : Cc | character code | 

Kakusu) : n { No. of strokes | 

Tensu : N | No. of points | 
l Xi Yi, X12 Yi2 Xi Yin 
2 Xo, Yay Xo, Yoo Xan Yon 
n Xu Yu Xn Yne Xan Yan 


Fig. 20—Form of pattern dictionary. 


to be compared for the specific number of 
strokes, a higher recognition ratio may be easily 
achieved. Therefore, 203 categories of hand- 
written ten-stroke patterns were collected from 
40 writers. These categories are shown in 
Table 2. 

This is a worst case experiment in the sense 
that the number of these categories is the 
maximum among the numbers of categories 
for characters with the same number of strokes. 
To eliminate learning effect, the structure 
dictionary was written manually for each 
category, using human knowledge. As a result, 
a recognition ratio of 99.5 percent was achieved, 
and it was found that the increasing number 
of categories did not shift the recognition 
ratio significantly. 

4.3.2 The dictionary structure 

In the recognition experiments two diction- 
aries were prepared for each category: the struc- 
ture dictionary and the pattern dictionary. 
The concept of the structure dictionary is as 
shown in Table 17, and the pattern dictionary 
is as shown in Fig. 20. The coordinate for the 
G of the pattern is assumed to be (0,0), and 
the frame of the pattern is assumed to be 
a square, one side of which is 100 units long. 
For example, the coordinate of the upper 
right corner of the frame is (+50,+50). In- 
dividual strokes within the frame are assumed 
to approximate n points of equal intervals, 
and their coordinates are recorded in the pattern 
dictionary. The numbers of the strokes are 
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Fig. 21—Sample pattern of a kanji. 


Code > 304C 
Kakust ie 
Tensu 5 
1 > -18 42, -23 30, -29 19, -36 7, -45 -2, 
2 -29 13, -26 1, -26 -10, -26 -22, -27 -37, 
3 +d 4% 5 45 Asp a) oasis 38: 
4 -14 27, -1 28, 10 28,22) 30;36 31, 


5 =] 18,0 6,0 =5,0 17,0. =33; 
6 : 24 22, 22 10, 20 -1, 19 -13,17 -28, 
7 =18 -39,-2 -34,12 -33, 27 33,45 -36, 


Fig. 22—Pattern dictionary form of a kanji pattern 
shown in Fig. 21. 


matched with the order in which they are 
extracted in structure resolution using the struc- 
ture dictionary. (The structure dictionary 
cannot be guaranteed to match the natural 
stroke order, but it is considered likely to do so 
in most cases.) 

The actual size of the dictionary is 13 Kbytes 
for a structure dictionary having 203 categories 
containing ten strokes, and 26 Kbytes for 
a pattern dictionary containing 5-point approxi- 
mation. An example of a pattern and the 
contents of the pattern dictionary are shown 
in Figs. 21 and 22, respectively. 

4.3.3, Recognition program 

The program for the recognition experi- 
ment was complied using the C language operat- 
ing under the operating system (UNICUS) 
of a PANAFACOM U-1500. The program size 
is approximately 15 Kbytes. The recognition 
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Table 18. Difference of recognition ratio with/without 
information of writing order 


Writing order 


Known Unknown 


99.78% 


Table 19. Relations between recognition and structure 
resolution 


Number of cases 


Resolution impossible and 
recognition impossible 


Resolution successful and 
recognition impossible 


Resolution impossible and 


recognition successful 
8 085 


Resolution successful and 
recognition successful 


Sum 


speed per character is approximately proportion- 
ate to the number of categories with which 
the input patterns are to be compared. It was 
approximately four seconds for experiment 2), 
which was the worst case. By comparison, 
the recognition time in cases where the stroke 
order was known was approximately two 
seconds. The difference between the two times 
is considered to be the time required to link the 
input stroke to the dictionary stroke by struc- 
ture resolution. 

4.3.4 Results of experiments 

The recognition process is divided into two 
stages: structure resolution and stroke level 
matching. A simple analysis was performed to 
determine how accurately the input strokes 
corresponded to the dictionary strokes through 
structure resolution, and the influence of this 


accuracy on the recognition ratio. 
The patterns of the 203 categories of ten- 


stroke kanji characters were used for this analy- 
sis. Each stroke of these 8120 input patterns 
(203 x 40) was manually given the correct 
stroke order. Then, stroke-level matching of 
these patterns was performed using dictionary 
patterns. Since the strokes corresponded 
completely in this case, the difference in rec- 
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ognition ratio is thought to have occurred 
during the structure resolution stage. 

The difference between the recognition ratio 
in this case and the case where stroke order is 
known is 0Q.1-0.2 percent (see Table 18). To 
determine whether this difference occurred 
during the structure resolution stage, the rec- 
ognition results were checked in relation to 
the structure resolution (see Table 19). It 
was expected that only correct structure resolu- 
tion caused correct recognition, but the results 
included a few cases of correct recognition even 
when the structure resolution was not correct. 
This may be caused by the redundancy of kanji 
characters. However, a case where the writing 
order is known must have the same result 
as in a case of correct structure resolution. 
So it still can be said that the difference in the 
recognition ratio between cases in which the 
writing order is known and in which it is un- 
known is due to a failure in structure resolution. 

The results of experiments 1) and 2) showed 
recognition ratios of 99.7 percent and 99.5 
percent. 

This algorithm features the use of structure 
resolution to make the individual strokes of 
the input pattern correspond to those of the 
dictionary pattern, instead of making a total 
comparison. While total comparison requires 
a processing time that is proportionate to the 
square of the number of strokes, structure 
resolution requires a processing time that is 
proportionate to the number of strokes; and, 
therefore, the relative efficiency of structure 
resolution increases with the number of strokes. 
Even taking into consideration that the total 
comparison method has a simple algorithm, 
the structure resolution in the case of ten-stroke 
kanji patterns requires a fraction of the time 
required for the total comparison method. 

The stroke order used by any writer was 


ignored entirely, and the correct stroke order 
was reconfigured through structure resolution, 
so the dictionary need not contain counter- 
measures against incorrect stroke order. It 
was verified that patterns with unknown stroke 
order could be recognized with only one struc- 
ture dictionary and one pattern dictionary. 
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Comparing the recognition method intro- 
duced in this paper to the method where the 
character has a fixed number of strokes and 
the order of all strokes is known, this method of 
processing needs double the time and the 
dictionary (byte) must be 1.5 times larger. 
That is, making correspondences between 
input pattern strokes and dictionary pattern 
strokes was accomplished in the same time 
as stroke level matching, using a dictionary 
which was 1.5 times larger. 


4.4 Further considerations 

As described above, 416480 (10412 x 40) 
pairs of arbitrary strokes were checked for their 
relative positional relations, using the patterns 
of 198 categories of kanji characters hand- 
written by 40 writers. 

However, since it was difficult to directly 
define the relative positional relation between 
two strokes, points were selected to represent 
the strokes and then the positional relations 
between those representative points were 
checked. 

As a result, it was found that among the 
pairs of two arbitrary strokes of handwritten 
kanji characters (a total of 10412 pairs for 
the 198 categories used for the experiments), 
10409 pairs had the same relative positional 
relations between the representative points, 
under a given condition, for all of the 40 differ- 
ent writers (see Table 11). The given condition 
was that the type of representative point (B, M, 
E, or G) to be observed and the direction 
(top-to-bottom or left-to-right) in which the 
observation was to be made were specifically 
determined for each pair of specific strokes 
in each specific category. 

This condition does not include any condi- 
tions placed on the writers. Therefore, the 
results shown in Table 11 indicate that it is 
possible to observe stable information concern- 
ing kanji structure. 

It is not known whether the same stability 
exists when changes are allowed in the number 
of strokes. 

In the view of this writer, the sense that 
a certain partial pattern is to the left of (or 
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above) another partial pattern is not limited 
to individual writers. This sense does not change 
even when the number of strokes changes. 

Therefore, the resolution method may be 
stable even when the number of strokes changes, 
but this remains to be confirmed in the future. 

Although the series of operations for kanji 
structure resolution (the structure dictionary) 
is also stable information for a kanji character, 
its stability is lower than that of the information 
concerning relative positional relations between 
strokes. However, as discussed earlier, it is clear 
that the relative positional relations between 
a certain partial pattern and another partial 
pattern will be quite stable if the direction of 
observation (top-to-bottom, left-to-right, or 
diagonal) is determined for each kanji category. 
For example, it is not possible for the kanji 
character {if (position) to be written as wf 
However, the relative position in the top-to- 
bottom direction of the 14 portion and the 
¥%. portion may be shifted, and their left 
and right positions may also get closer or a 
little farther apart, but inversion of the left 
portion and the right portion is impossible. 

The structure dictionary is an implementa- 
tion of these stable relations (see Fig. 12 and 
Table 17). 

Possible future research includes: 

1) Matching the structure matrix itself for 
recognition instead of stroke level matching. 

2) This paper describes on-line handwritten 
characters, because strokes can be extracted 
easily, or rather the representative points 
of strokes can be extracted easily and, 
therefore, experiments can be carried out 
easily. 

It will be proved, however, that the basic 
techniques described in this paper can be applied 
in not only on-line but also off-line processing. 
3) This paper describes how a stroke of a kanji 

character is represented by one representa- 

tive point, and this is possible because most 
kanji strokes are straight lines. For cursive 
characters such as those for handwritten 
alphanumeric characters and__hiragana 
characters (Japanese cursive kana characters), 
a theory similar to the one described in 
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this paper may be developed using multiple 

representative points instead of one rep- 

resentative point. 

4) Although this paper deals with kanji 
characters, more generalized applications 
may draw upon this research to consider 
the usefulness of replacing a certain pixel 
with a simpler pixel, or of replacing the 
relative positional relation of the original 
pixel with that of a simpler pixel. 

This paper has described two types of stable 
information for kanji characters, the structure 
matrix and = structure dictionary, and also 
described their interrelation. By using the 
structure dictionary as information for extract- 
ing stroke order for kanji character recognition 
through stroke-level matching, a recognition 
of 99.5 percent was achieved for kanji characters 
containing ten strokes. 

The recognition method that uses a struc- 
ture dictionary is called the structure resolution 
method. This method is significant when the 
kanji to be resolved consists of more than two 
strokes. If it contains only one stroke, this 
method is nothing more than stroke-level 
matching. 


5. Conclusion 

Two types of stable information for kanji 
characters have been described. One is the 
stability of the relative positional relation 
between kanji strokes, which can be summarized 
in the form of a structure matrix. 

The other is the structure dictionary, which 
can assign unique numbers to the strokes of 
a kanji character of n strokes in (nm — 1) steps. 
In other words, this information is capable of 
giving the stroke order. 

The structure matrix cannot be evaluated 
unless the stroke order is determined. Therefore, 
the structure matrix of the input pattern is 
evaluated after the stroke order of each stroke 
has been determined by applying the structure 
dictionary to that input pattern. 

The structure dictionary can be produced 
using human knowledge, that is, the knowledge 
that one partial pattern is to the left of another 
partial pattern, etc., and it can also be produced 
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automatically from the structure matrix by 
the algorithm described in section 3.2. In the 
latter case, the structure matrix must be defined 
first, so in this case, the correct stroke orders 
of the experimental patterns to be collected 
must all be known. 
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This paper describes a model for a cooperative expert system which simulates human group 


behavior. When social phenomena are observed, Fujitsu has realized that experts usually 


act autonomously. But there are also many cases when experts must work in close coopera- 


tion. The major design goal of this study is to construct a framework to naturally represent 
cooperation among expert systems. Since expert systems interact only via conversation, an 


expert system cannot access or change another expert's knowledge base without the owner's 
permission. This system can also process non-deterministic tasks by parallel inference. 


1. Introduction 

There are few tasks that can be done by a 
single person. When we confront a problem that 
is not in our area of expertise, we ask a specialist 
and follow his advice on how to solve the 
problem. To solve the problem using computers, 
a study aimed at human group behavior instead 
of individual behavior is required. A system that 
simulates the world of many experts is useful for 
this study. 

Our cooperative expert systems can re- 
present such a world and can simulate its be- 
havior naturally. This paper describes our 
attempt to design a distributed system based 
on cooperation among many expert systems, 
and our experimental system which simulates a 
historic battle fought by warlords of ancient 
Japan. 


2. Advantages of distributed Al 

Distributed systems have the advantages of 
speed, reliability, and expandability. These 
advantages of distributed artificial intelligence 
systems are described below. 


2.1 Parallel processing 
Parallel processing makes the system faster 
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and expands the system’s capacity. For example, 
many parallel searches of a search tree or parallel 
execution of rules is enabled. 


2.2 Tools for distributed systems 

When the system involves objects that are 
far apart from each other, an effective measure 
against speed reductions caused by the data 
traffic is to process the lower level tasks locally 
and process higher level tasks centrally. This 
idea has a wide application. Sensor network 
systems, air traffic control systems, and dis- 
tributed database systems are potential applica- 
tions. 


2.3 Individual expert systems 

The quality of inference can be improved 
if many agents process the same task from 
different points of view. In addition, the ar- 
rangement of expert systems at different sites 
can raise the independence and privacy of 
individual knowledge. 

The independence of individual knowledge 
is discussed first. The best representation of 
knowledge depends on the field of the task. 
Many models support multiple paradigms in a 
single expert system, but we believe it is simpler 


203 


N. Watanabe, and T. Kimoto: Cooperative Expert System - - - 


Personal knowledge 


Frames 


Access 


Conversation 


a) Stand alone expert system 


Read/update 


ene 


b) Distributed expert systems 


Fig. 1—Access to personal knowledge. 


and more powerful if a single system has multi- 
ple expert systems based on different paradigms. 

Second, it is important to consider security. 
In some professional fields, the expert is pro- 
hibited from disseminating his knowledge. The 
expert may also want to keep his knowledge 
a secret because of its value. In such cases, the 
knowledge of many people cannot be put 
together in a single expert system. Instead, it is 
better that each person manages his own knowl- 
edge. Such a complex intelligent system must 
also support a function whereby each agent 
can give others only the result and a brief 
explanation instead of the knowledge itself. 

Consider the expert systems consisting of 
rule-based knowledge and frame-based knowl- 
edge (see Fig. 1). In a stand alone expert system, 
boundaries of individual knowledge become 
vague and it is difficult to determine how 
strongly a change in the knowledge will influ- 
ence another person’s action. Further, experts 
might be worried about unauthorized persons 
reading or changing his knowledge base. In 
distributed expert systems, an agent cannot 
access another’s knowledge base without a 
conversation with the owner agent of the knowl- 
edge. This is a useful function to eliminate 
users’ concerns about security. 
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3. Cooperative expert systems 

Our model of cooperative expert systems 
is mainly aimed at the third advantage. An 
agent — processing object — represents one 
person. If the agents incorporate their knowl- 
edge, it increases cooperation and competition. 

If a stand alone expert system simulates one 
person’s thought process, then cooperative 
expert systems simulate a human group’s be- 
havior. Each agent plays a different role in the 
organization, and they work on the task with 
frequent interaction. It is interesting to apply 
this model to simulate animal behavior or in a 
more complex example, to simulate a human 
group’s behavior under fire. 


3.1 Requirements for individual agents 

The many agents act concurrently and 
change the common environment simultaneous- 
ly. Therefore, each agent is required to collect 
the latest information, and to plan and act 
dynamically because he cannot analyze the task 
and plan the solution in advance. 

Each agent must have the functions de- 
scribed below. 
1) Updating his world model simultaneously. 
2) Collecting information from other agents. 
3) Replanning based on new situation. 
4) Replying to other agents’ requests. 
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5) Requesting other agents to act for own 
benefit. 


3.2 Problems 

To clearly understand the cooperation, an 
experimental system was configured using 18 
personal computers (Fujitsu) FM16§$)_ that 
were connected by a network. One agent was 
assigned to each computer. This configuration 
makes it easy to understand that each agent 
can act independent of the others. We represent 
each knowledge base using rule-base and frame- 
base paradigms. The solutions to the above 
requirements are given below. 

3.2.1 Simultaneous updating of each world 

model 

The world model is common data that every 
agent can access. 

It is constantly changing which causes a 
problem for all distributed database systems. 
The way in which each site manages the com- 
mon data increases the amount of traffic to 
access. However, the way in which each site 
obtains a copy of the common data makes it 
difficult to make their copies exactly the same. 

This inconsistency problem is the most 
serious problem for distributed database sys- 
tems; but if our goal is to simulate human group 
behavior, this inconsistency instead becomes a 
rather interesting feature. Humans tend to act 
based on what they see rather than on the real 
world itself. A person acts, and if he doubts the 
environment, he senses the world again and 
replans. 

Under these circumstances, we decided that 
each agent should have his own copy of the 
world model which is updated when the manage- 
ment computer sends update packets. Each 
agent draws an inference based on an old copy 
of the world. 

3.2.2 Information collection from other 

agents 

Individual agents have their own knowledge. 
That is why agents cannot use another agent’s 
knowledge without the owner’s permission. 
To obtain another agent’s knowledge, they 
must interrupt the destination agent who might 
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be busy inferring. This approach makes it 
possible for the owner of the knowledge to 
decide whether to show or to hide his knowl- 
edge. This is one kind of request, but it is 
important to separately consider the individual 
management of the knowledge base. 

3.2.3 Replanning when a _ new situation 

arises 

We often find that a prior condition for an 
agent becomes inadequate when another agent 
updates the world model. In such cases, what 
does a human do? He acts based on what he 
sees. If he notices that the result is different 
from what he expects, he determines that some 
assumptions are inappropriate and checks the 
environment to see what has changed. It is 
difficult for a human to continue inference using 
variables that represent possible changes on 
some future day. This system, a simulator of 
a human group, has chosen the human way of 
inference. In this approach, the system assumes 
that a prior condition may not change until it 
processes one step of inference. Then it checks 
the world model to see whether the condition 
still holds. It is quite possible that this kind of 
inference leads to wrong conclusions, but it is 
more similar to human inference than “strict” 
inference. 

While our system has this benefit, we must 
use the knowledge source considering that an 
action seldom leads to the expected result. 

3.2.4 Answering other agents’ requests 

To enable timly answers to other agents’ 
requests, a special knowledge source called 
“answering knowledge source” was added to 
each agent. This can be regarded as a kind of 
interrupt routine. As with ordinary interrupt 
routines, careful coding is needed because this 
special knowledge source is called during infer- 
ence. Our system cannot guarantee consistency 
after calling the knowledge source. 

3.2.5 Asking other agents for help 

It is difficult to decide the protocol of 
conversation among agents. Different applica- 
tions require different terminology, so it is 
possible that the best protocol depends on 
each application. Since we cannot decide the 
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FM --7 FM | (Battle field simulator) 


Tp 
\ 2 


of 
(Agent simulator) 


Fig. 2—Simulation game system using personal 
computers. 


FM 


best universal protocol, our experimental 
system used a variety of protocols: from key- 
words that the receiver knows in advance, to 
S-expressions that the receiver has to evaluate. 
Although we avoided this problem in order to 
construct the entire system quickly, we are 
very interested in this problem of conversation 


among machines. 


4. Application of a simulation game system 

An experimental simulation game system 
was developed as an application of cooperative 
expert systems (see Fig. 2). The cooperative 
system can solve one problem being worked 
on by many agents. It has many agent com- 
puters that infer using their expert knowledge, 
and has a battlefield computer that simulates 
the battlefield and controls the network traffic. 

The simulation game is a board game in 
which the many pieces represent army corps 
simulating a past battle. Unlike other games 
such as chess, more than one piece can be 
moved at a time which is a good example of 
cooperative inference. An ancient Japanese 
battle called the Battle at Sekigahara was simu- 
lated to test not only the historical accuracy 
but also a variety of hypotheses. 
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Fig. 3—Configuration of cooperative expert system. 


4.1 System configuration 

4.1.1 Hardware configuration 

Figure 3. shows the configuration of. the 
experimental system. Twenty personal com- 
puters (Fujitsu FM-168) were connected using 
Omninet which is the standard network bus 
for personal computers. 

Personal computers were used for one of 
the following three purposes: 
1) Battlefield computer 

Only one of the twenty computers acts 
as the battlefield computer. It simulates the 
world model (battlefield), broadcasts the simula- 
tion result, judges the battle, and displays the 
battlefield on a 100 inch projector. 
2) Agent computers 

Eighteen computers play as pieces (agents) 
of the game. Each agent corresponds to one 
commander and his men. 
3) Console computer 

One computer is for the human operator to 
start, break, restart, and end the simulation. 

4.1.2 Software configuration 

Figure 4 shows the software configuration. 
The operating system of each computer is 
MS-DOS Version 3.1. The network controller 
module is CP-MGR Version 1.0 (includes pseu- 
do-multi-tasking and computer communication). 
All modules except the network controller have 
been developed using GCLISP. 

Major module sizes are as follows: 


1) Master controller: 3.0 Ksteps 
2) Agent monitor: 3.0 Ksteps 
3) Master simulator: 1.3 Ksteps 
4) Display module: 1.9 Ksteps 
5) Inference engine: 1.0 Ksteps 
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Fig. 4—Software configuration. 
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6) Knowledge source: 4.0 Ksteps 

4.1.3 Network control 

Figure 5 shows how agents communicate 
with each other via Omninet. Each computer 
has two background processes: a send process 
and a receive process. The main process com- 
municates with these processes via RAM disk 
(file-like formatted main memory). Figure 6 
shows the send processing sequence and Fig. 7 
shows the receive sequence. Figure 8 shows the 
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Fig. 6—Send sequence. 
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Fig. 7—Receive sequence. 


data format for RAM disk. The module was 
divided into three processes because GCLISP 
does not have the function to call CP/MGR 
directly. 


4.2 Rules of the game 

This simulation game is similar to chess 
except that many pieces can move at the same 
time. 
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1) Send data 


CC : Condition code 
nie d ee. CR : Delimiter 
2) Send condition (carriage return code) 
DA : Destination computer 
address 
DATA: Character data 
MB : Mail box number 
3) Receive request SA : Source computer 


address 
MB TIME SW : Sense/wait flag 


TIME : Wait time (x 100 ms) 


4) Receive data 


Fig. 8—Send/receive frame format (in RAM disk). 


4.2.1 Processing sequence of the game 

Simulation is divided into turns. 

One turn has the following three phases: 

1) Moving phase 

All agents move at the same time. Normally 
in a board simulation game, the two teams move 
at different times. This rule was changed to 
make the game more like a real battle. 

2) Cannon phase 

All agents having cannon can fire their 
cannon once in this phase. 
3) Fight phase 

All agents can attack other agents. 

Figure 9 shows the sequence of the master 
control module in the battlefield computer. In 
each phase, all agent computers send action 
packets to the battlefield computer. Because 
the battlefield computer processes those packets 
one by one, simulation becomes non-deter- 
ministic. For example, suppose one agent sends 
a move packet to a place that was empty last 
turn. If another agent has moved there first, 
then the first agent would receive a result packet 
that says “could not move there’. Due to this 
non-determinism, the simulation will be differ- 
ent from time to time. 

4.2.2 Battlefield 
1) Map 

The battlefield was represented as a map 
consisting of many unit areas. Each unit area 
has an attribute such as a plain, hill, river, 
woods, or road. Some parameters are dependent 
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Fig. 9—Sequence of master controller. 


on this attribute: moving rate, attack rate, and 
view rate. 
2) Pieces 

Each piece has one variable attribute and 
five static attributes. Life is a variable attribute. 
When an agent is attacked by others his life 
decreases. If his life becomes zero, he dies and 
is removed from the battlefield. 

Other attributes are static. Moving power is 
an agent’s ability to move on a plain field in 
one turn. Moving ability on other fields is de- 
rived from the product of the agent’s moving 
power and the area’s moving rate. 

Defense power is the agent’s ability to 
decrease the enemy’s attacking power. Real 
damage is determined by the product of the 
agent’s own defense power and the enemy’s 
attacking power. 

Attacking power is the agent’s ability to 
take an enemy’s life. 

Attack range is the distance within which 
an agent can attack the enemy. 

View range is the distance within which an 
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agent can see other agents. 
3) Move 

An agent can move to any empty area if 
the distance is within his real moving ability. 
Moving ability is the product of the agent’s 
moving power attribute and the moving rate of 
the area where the agent is located. Even if 
there is another agent on the way to the same 
destination, he can move unhindered. 
4) Attack 

An agent can attack any enemy within his 
real attack range. The damage to the enemy 
is the product of his attack ability, the enemy’s 
defense power, and some random number. If 
the enemy does not die in the first attack, the 
enemy can make a counterattack soon after. 
5) End of battle 

Each player determines whether or not the 
battle is concluded according to the circum- 
stances. 


4.3 World model simulator 

The world model simulator simulates the 
battlefield and move/attack action of the agents. 
The master simulator in the battlefield computer 
controls the entire sequence of move/attack 
actions of all agents and does not get involved 
in the conflict. The slave simulator in each 
agent computer replies to the information 
collection requests from agents to reduce 
the amount of data transferred. The slave 
simulator updates the world model based on 
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the update packet from the master simulator. 
Figure 10 shows the display output of the 
world model simulator. 


4.4 Outline of experimental system 

4.4.1 Agent monitors and knowledge 

sources 

The agent monitor calls knowledge sources 
to move, fire, and fight; sends the action packet 
to the master controller in the _ battlefield 
computer, and receives the result packet from 
the master controller. The agent monitor also 
has inter-agent communication functions to 
cooperate with other agents. We developed two 
types of communication functions: an orderly 
(one to one communication), and signal fires 
(one to all communication). Each agent has 
at least four knowledge sources: moving, fight- 
ing, responding to signal fires, and responding 
to orders. These knowledge sources are pre- 
viously registered with the agent monitor, 
and the agent monitor calls them as required. 
Knowledge sources for responding to signal 
fires and orders are called as soon as the cor- 
responding packets arrive. Figure 11 shows the 
display output of the agent monitor. 

4.4.2 Functions of agent monitors 

The agent monitor calls three knowledge 
sources in one turn in the following sequence: 
moving, firing, and fighting. It waits for the 
result packet after each call to update the 
slave world model (see Fig. 12). 


Fig. 10—Display output of world model simulator. 
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Fig. 11—Display output of agent monitor. 
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Move KS 


Call move KS (Inference) ——» Send move packet 


Return 
Receive move result 
Update slave simulator 


Cannon KS 


Call cannon KS (Inference) ——*Send cannon packet 


Return 
Receive cannon result 
Update slave simulator 

Fight KS 


Call fight KS (Inference) ——» Send fight packet 


Return 
Receive fight result 


Update slave simulator 


Fig. 12—Sequence of agent monitor. 


4.4.3 Simulation 

The simulation was tested based on an old 
Japanese battle in 1600 called the Battle at 
Sekigahara. The rule-based bits of knowledge 
were created based on the character, army 
forces, financial resources, and personal ex- 
pertise of eighteen important commanders. 
The system can perform a simulation based not 
only on historical facts, but also based on 
various hypotheses. The results based on the 
hypotheses are described below. The system 
performs the simulation non-deterministically 
so the result is not always the same. 

Because our agents do not have the ability 
to learn, they can deal with only the events 
that are known in advance. The simulation 
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shows that even these poor agents can cooperate 
with each other with some very interesting 
results. 
1) Making a commander’s will stronger 
First, a commander’s will was changed to 
make it stronger. In real history, commander 
Kobayakawa betrayed his side due to a threat. 
In this simulation he did not betray and many 
commanders observing his attitude took his 
side and they won. 
2) Decreasing the threshold of judgement in 
his men 
Second, Ishida’s threshold of having confi- 
dence in his men was decreased. In real history 
he led one side and lost. In this simulation he 
won to his side many commanders and subse- 
quently won. 
3) Replacing one commander with his father 
Lastly, Nagamasa Kuroda was replaced by 
his father Josui Kuroda who was a famous and 
resourceful general. When the battle came to a 
standstill, he appealed to the commanders 
observing the battle situation and organized a 
third group. 


5. Conclusion 

A model for a cooperative expert system 
was proposed to simulate human group behavior 
and an experimental simulation game system 
was developed as an application. This model 
is useful for such an application, but there are 
problems which require further study: learning, 
protocols of conversation, and world model 
conflicts being the most interesting problems. 
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This paper describes a 144 kbit/s digital subscriber loop (DSL) transmission system based on 
hybrid-mode transmission with an echo cancelling method. It uses advanced LSI technology 
to obtain compactness, low cost, and high reliability. An echo canceller (EC) LSI has been 
developed using CMOS technology. Combined with the multiplexing processor (MXP) LSI, 
the EC LSI provides basic DSL equipment functions. The system consists of a specially 
arranged frame format with a newly developed DPLL circuit for stable timing extraction, 
and automatic balancing nextwork, and a two-stage echo canceller. 

Using this line termination circuit, the DSL equipment showed a reach of over 6 km when 
used with 0.5 mm diameter cable for 160 kbit/s bidirectional digital transmission. 


1. Introduction 

The world is entering the ISDN era, where 
a variety of services will be offered through 
a unified user/network interface. This can be 
done by establishing an end-to-end digital link. 
Therefore, for the penetration of the ISDN, the 
economical realization of bidirectional digital 
transmission on existing copper pair cable is 
mandatory. Extensive studies are under way in 
this field, and some results have been publish- 
ed)! Two promising transmission  tech- 
nologies are emerging, one being time compres- 
sion burst-mode transmission (the ping-pong 
method) and the other being hybrid-mode 
transmission using an echo canceller. 

For hybrid-mode transmission, a unique 
method which relieves the system from the 
problem of timing recovery is proposed!!). 
Based on this method, a digital subscriber loop 
(DSL) transmission system was developed. 
A two-chip method, shown in Fig. 1, was 
selected for making the DSL equipment. One 
chip is for circuit termination, with the main 


function of forming a transmission format 
including frame synchronization. Since the 
processing is done by software in the multiplex- 
ing processor LSI (MXP), this chip provides 
system engineers with maximum flexibility in 
designing a transmission frame format!”). The 
other chip is the line termination LSI (EC LSI). 
Its main functions are echo cancellation, line 
equalization, and timing extraction. 

The EC LSI, now complete, uses the CMOS 
process!?), The LSI is described in some detail 
in this paper. It exhibited excellent  per- 
formance, such as error-free transmission over 
more than 6 km of 0.5 mm diameter cable, and 
a quick activation time of less than 220 ms. 

Chapter 2 of this paper briefly compares the 
burst-mode and _ hybrid-more — transmission, 
Chap. 3 describes the design concepts, Chap. 4 
describes the transmission frame format used in 
the authors’ approach, Chap.5 describes the 
circuit configuration and the _ activation 
procedure, and Chap. 6 discusses the actual LSI 
and its performance. 


The contents of this manuscript was published in part by the IEEE Journal on Selected Areas in Communications 


in April, 1988. 


212 


FUJITSU Sci. Tech. J., 24,3, pp. 212-224 (September 1988) 


T. Tsuda et al.: ISDN Subscriber Loop Transmission LSI Using Echo Cancelling Method 


Customer premises oa Subscriber a Oi —_——— 
loop 


S/T U 


To 
switch 


2 wire line 


EC LSI: Echo canceller LSI NT: Network terminator 
MXP: MUX processor LSI ET: Exchange terminator 


Fig. 1—Digital subscriber loop transmission. 


2. Comparison of the burst-mode and hybrid- 
mode transmission 
2.1 Transmission technique 

Figure 2 shows a schematic diagram of the 
burst-mode and the hybrid-mode transmission 
systems. 

In the burst-mode transmission of Fig. 2-a), 
data to be transmitted is time-compressed into 
packets. Bidirectional digital transmission is 
achieved by alternately transmitting the packets 
of burst signal in each direction between the 
exchange terminator (ET) and the network 
terminator (NT). Therefore, on the subscriber 
loop, the received signal is free from the inter- 
ference of the transmitted signal, while the line 
bit rate of the packets of burst signal exceeds 
twice the information data rate. The burst-mode 
transmission, which is based on a_ simple 
principle, is comparatively easy to apply to the 
equipment. Moreover, since the transmitted and 
received signals are separated ona time axis, the 
near end crosstalk is avoided using the burst- 
synchronization technique, where phases of the 
burst signals of all the subscriber loops are 
synchronized and transmitted simultaneously 
from the ET®?. 

Hybrid-mode transmission, using the echo 
cancelling method shown in Fig. 2-b), enables 
simultaneous bidirectional digital transmission, 
or a full duplex digital transmission. Separation 
of the transmitted and received signals is 
performed using a conventional hybrid 
transformer. The problem of this technique is 
the large echo leakage. The echo peak amplitude 
of the transmitted pulses suppressed by the 


FUJITSU Sci. Tech. J., 24, 3, (September 1988) 


akbit/s 


“+ akbit/s 


akbit/s 


b) Hybrid-mode transmission 


Fig. 2—Transmission techniques. 


hybrid transformer has a 10-dB loss compared 
with the transmitted signal amplitude, while the 
minimum received signal amplitude has a 50-dB 
loss due to the \/f loss characteristics of the 
subscriber loop. The key to realizing hybrid- 
mode transmission is the construction of a high- 
performance echo canceller which achieves echo 
cancellation of over 60 dB. Despite this difficul- 
ty, the hybrid-mode transmission is considered 
a promising method, because the line bit rate is 
almost the same as the information bit rate, 
which is about half compared with that of the 
burst-mode transmission. The reduction of the 
line bit rate is a great advantage for the digital 
subscriber loop transmission because the sub- 
scriber loop loss increases in proportion to the 
square-root of frequency, or the transmission 
line bit rate. This means that the hybrid-mode 
transmission is expected to obtain a longer reach 
of digital subscriber loop transmission. 


2.2 Circuit structure 

Figure 3 shows block diagrams of the DSL 
equipment for burst-mode and hybrid-mode 
transmission. The MXP LSI, which performs 
the circuit terminating functions, is used for 
both methods. The circuit structure of the 
hybrid-mode transmitter (U/B, DRIV) is the 
same as that of the burst-mode transmission of 
Fig. 3-a), as shown in Fig. 3-b). An adaptive 
echo canceller (EC) and a. balancing network 
circuit (BN) are added to the hybrid-mode 
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Fig. 3—Circuit structure. 


receiver for the echo cancellation. Apart from 
the echo canceller and balancing network 
circuit, the timing extractor (TIM) and the line 
equalizers (EQL) which contain the Vf AGC 
equalizer and the decision feedback equalizer are 
almost the same as those of the burst-mode 
transmission system. Consequently, the hybrid- 
mode transmission system requires a com- 
plicated circuit structure and large circuit scale 
for the echo canceller and the balancing network 
circuit. 

Taking account of the above-mentioned 
characteristics of both methods, an LSI for 
burst-mode transmission was developed in the 
first stage, and proved to have good perform- 
ance!*), An LSI for hybrid-mode transmission 
was developed in the second stage. The tech- 
nologies employed in the development of the 
burst-mode transmission LSI were applied to 
this line terminating LSI using the echo can- 
celling methos, which is described in this paper 
in some detail. 


3. Design considerations and solutions 

The following points were considered in the 
design of the transmission system and circuit 
structure. 
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Channel capacity 


Line code 


Table 1. Main specifications for the echo canceller LSI 


Line bit rate 160 kbit/s, 96 kbit/s 


2B + D (160 kbit/s) 
B+D( 96 kbit/s) 
Bipolar (AMI) 


Vf equalizer 


dvutade ranks 0-50 dB at Nyquist frequency 


Decision feedback 


i 
equalizer 2 taps (1 T, 2 T) 


Echo canceller Balancing network + 2-stage 


echo canceller > 70 dB 


Peak jitter width l< 5% 


Manufacturing process | Si-gate CMOS 


1) To avoid the use of high-precision com- 
ponents 

2) To reduce the number of gates 

3) To use as simple and stable an algorithm as 
possible 

4) Suitable partitioning of functions to analog 
and digital circuits 

5) Reasonable distribution of the processing 
load to each functional block 
As a result of the study, the transmission 

system and EC LSI developed by the authors 

have the following characteristic functions: 

1) A specially formatted transmission frame 
structure with a newly developed DPLL 
circuit is adopted for simple and _ stable 
timing recovery. 

2) An automatically selected balancing network 
is used to reduce the load of the echo 
canceller by suppressing the echo leakage. 

3) The echo canceller has a two-stage structure 
to reduce the echo replica precision. 

4) The line equalizing function is partitioned 
into an analog linear \/f AGC equalizer and 
a digital decision feedback (DFB) equalizer. 

5) An alternate mark inversion (AMI) line code 
is used because of its simplicity. 

6) Echo cancellation and DFB equalization are 
performed simultaneously with a common 
D/A converter to reduce the number of 
gates. 

Table | lists the main specifications of the 

EC LSI. It should be noted that two line bit 

rates are provided, one for (2B +D) channel 
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Fig. 4—Timing diagram for timing extraction. 
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Fig. 5—An example frame format for 160 kbit/s line 
transmission. 


transmission and the other for (B + D) channel 
transmission. The latter is designed to offer 
a digital link on a longer cable, although the 
channel capacity is limited. 


4. Frame format 

Since the echo cancelling method requires 
very stable timing clocks to perform highly 
precise echo cancellation, a specially arranged 
frame format was adopted to achieve stable 
timing extraction''). The frame is constructed 
with frame/timing pulses (FP/TP) which contain 
an 1/0 alternating bit (M), control/status bits 
(CO/ST), information bits (1) and a de balancing 
bit (BB). Figure 4 is a detailed timing diagram 
for timing extraction at the NT. Frame pulses of 
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Fig. 6—Block diagram of echo canceller LSI. 


the transmitted frame, consisting of O pulses 
followed by a 1 pulse, are synchronized with 
those of the received frame pulses. The Os 
cover the FP/TP reception period of the received 
frame. A 14-bit blank period precedes FP to 
avoid the influence of the echo tail of the 
previous frame. With this frame format, the 
timing extraction circuit is free from the 
influence of echo signals and stable timing 
extraction can be obtained. 

However, this kind of frame arrangement is 
not necessary at the ET in the switching office. 
Because the ET has a master clock which 
governs the complete systems, it is sufficient to 
shift the phase of the master clock to attain the 
clock for the decision and the echo cancellation 
in accordance with the round-trip delay of the 
subscriber loop. Owing to line equalization and 
echo cancellation in the analog domain, a suf- 
ficient marginal eye opening on the time axis 
makes the system resilient to phase shift due to 
drifting of the round-trip delay. The adjustment 
is done during the initial training period using 
a DPLL circuit. 

The line bit rate is set depending on the 
frame repetition cycle and the capacity of 
CO/ST. The authors used a 160 kbit/s line bit 
rate, whose frame structure is shown in Fig. 5. 
This contains 144 kbit/s 2B +D channels of the 
CCITT 1-430 standard format and a 4 kbit/s 
CO/ST channel. The frame repetition cycle is 
2ms. For the 96 kbit/s line bit rate, which is 
expected to enable a longer reach, the infor- 
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Fig. 7—Admittance characteristics of balancing networks. 


mation capacity decreases to 80 kbit/s for the 
B+D channels and the capacity of CO/ST can 
be set to 7 kbit/s. 


5. Circuit configuration 

Figure 6 is a block diagram of the EC LSI. 
The LSI provides functions such as unipolar to 
bipolar (AMI code) signal conversion in the 
transmitter, and echo cancellation and the 3R 
functions (reshaping, retiming and regenerating) 
in the receiver. 

Data to be transmitted is framed and 
converted to bipolar pulses (U/B) and sent to 
the line driver (DRIV). The echo leakage is 
suppressed by the hybrid transformer (HYB). 
The automatically controlled balancing network 
circuit (BN) reduces echoes as much as possi- 
ble!!). The two-stage echo canceller consists of 
a first-stage echo canceller (EC 1) and a second- 
stage echo canceller (EC ay"), The Vf AGC 
equalizer /f AGC EQL) compensates for the 
loss characteristics of the subscriber loop. The 
decision feedback equalizer (DFEQL) com- 
pensates for residual intersymbol interference 
and the echo caused by bridged taps!” . Timing 
extraction is achieved using a newly developed 
digital phase locked loop circuit (DPLL)!*). In 
the following sections, the key techniques are 
described in detail. 


5.1 Automatically controlled balancing network 
circuit 
The purpose of this circuit is to suppress the 
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Fig. 8—Echo suppressing characteristics. 


echo leakage from the transmitter to the receiver 
by matching the line impedance and balancing 
network impedance as closely as possible. 

It was confirmed experimentally that eight 
impedance’ characteristics were enough to 
suppress the peak amplitude of the echo up to 
over 30 dB for the wide range of the line con- 
figurations. Figure 7 shows the admittance 
characteristics of eight balancing networks. The 
echo suppressing characteristics by the eight 
balancing network circuits for various line 
configurations are shown in Fig. 8. As can be 
seen in this figure, an improvement of more than 
18 dB over a simple resistor termination is 
obtained. 

Figure 9 shows the structure of the auto- 
matically controlled balancing network circuit. 
Taking account of manufacturing fluctuation in 
the CMOS process, two resistor networks, each 
consisting of eight resistors, were prepared on 
the LSI chip to approximate various line 
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Fig. 9—Block diagram of automatic balancing network. 
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Fig. 10—Echo canceller configurations. 


conditions. The optimum combination of two 
recistors, one from each network, is chosen by 
detecting the echo peak during the initial train- 
ing procedure. The convergence algorithm is as 
follows. The reference level of the comparator 
(COMP) is first switched to the lowest level, V 1. 
The echo peak is compared to the reference level 
each time a combination of two resistors, one 
from each network, is selected. When the echo 
peak is lower than the reference level, that 
combination is chosen. If the echo peak exceeds 
the reference level for all combinations of two 
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resistors, the next higher level, V 2, is chosen as 
the reference level. The procedure is repeated 
until an echo peak lower than the reference level 
is detected. Convergence is quick because the 
comparison need be made only 64 times for 
each reference level. Only one capacitor is 
necessary outside the LSI. 


5.2 Two-stage echo canceller 
Figure 10-a) shows a typical hybrid transmis- 

sion system configuration with an echo can- 

celler. In this configuration, the echo canceller 
suffers from the following problems. 

1) High-precision amplitude and dense time 
axis echo cancellation are required. 

2) Even if a satisfactory echo cancellation is 
obtained at the sampling point, a residual 
echo may exist away from the sampling 
point. This will cause unacceptable eye 
closure at the decision point due to the 
filtering characteristics of the /f equalizer. 
The second problem can be solved by the 

detecting error signal at the output of the Vf 
equalizer, shown in Fig. 10-b), for adaptation of 
the echo canceller. This configuration, however, 
introduces the problem of instability or the need 
for a very sophisticated algorithm for adap- 
tation, since each tap coefficient for echo 
cancellation becomes correlated due to the 
filtering of the /f equalizer. These problems can 
be solved by using an echo canceller with a two- 
stage structure, which was introduced in the 
authors’ system. 

Figure 11 is a block diagram of the two-stage 
echo canceller. Both the first stage echo 
canceller (EC 1) and the second stage echo 
canceller (EC 2) use a table look-up method. 
EC 1 operates on a high-speed sampling clock 
(four times the baud-rate) with a coarse 
quantization step size which aims to prevent 
signal saturation in the Vf AGC EOL, EC 2 
compensates for the residual echo due to the 
coarse quantization step size of the EC | and the 
filtering effect of the Vf AGC EQL at the 
decision points. EC 2 operates on a baud-rate 
sampling clock with a fine quantization step 
size. 

Considering the various line configurations 
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Fig. 11—Block diagram of two-stage echo canceller. 


including mixed gauges and bridged taps, EC 1 
covers 5 baud-rate intervals with 20 taps and 
EC 2 covers 8 baud-rate intervals with 8 taps, 
the first five of which overlap with the EC 1. 

By adopting the two-stage structure, it was 
found that 7-bit precision was sufficient for the 
D/A converter of EC 1 and 8-bit precision for 
EC 2, one bit being for overflow protection 
since this D/A converter has to represent the 
added value of the echo replica and DFEQL 
signal, 

After the initial convergence using the 
training procedure, only EC2 enters the 
adaptive operation mode. In this mode, the tap 
adaptation is carried out when the all the 0 
patterns are detected in the received data train. 
The echo replica is updated if the error signal is 
detected having the same polarity’ eight 
consecutive times for each corresponding 
transmitted pulse pattern. This procedure 
enables the stable adaptation of EC 2 and the 
echo replica catches up with the characteristic 
changes of the transmission line with a long time 
constant. 


5.3 Line equalizers 

Two types of line equalizers are built into 
the EC LSI chip. One is the ,/f AGC EQL which 
compensates for subscriber loop loss up to 
50 dB at the Nyquist frequency. The other is the 
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Fig. 12—Block diagram of VV f AGC equalizer. 


DFEQL which compensates for the residual 
intersymbol interference and the echo caused by 
bridged taps!*"!5), 

Figure 12 is a block diagram of the Jf AGC 
EQL. It is constructed using analog circuits 
consisting of two variable-gain flat amplifiers 
(EQL 2, EQL 3), a high-pass filter with variable 
cutoff frequency (EQL1), a low-pass filter 
LPF), a roll-off filter (ROLL OFF FIL), a switch 
decoder (SW DEC) and an automatic gain 
control circuit (AGC). Two filters, LPF and 
ROLL OFF FIL, located at the front end, 
suppress the out-band-signal components and 
release the signal from the amplitude saturation 
at the high-pass filter section. The equalizer 
EQL 1 compensates for the \/f loss character- 
istics. This is also an effective means of shorten- 
ing the echo tail. The Jf AGC EQL is designed 
to be used with two transmission bit rates 
(160 kbit/s or 96 kbit/s). The optimum com- 
binations of variable flat gain of EQL2 and 
slope gain of EQL 1 corresponding to the loop 
loss characteristics are memorized in the on-chip 
ROM. The desired transmission bit rate can be 
chosen by pin-programming the LSI. 

Figure 13 is a block diagram of the DFEQL 
with two taps. The DFEQL operates ona baud- 
rate sampling clock. Tap coefficients are 
automatically trained during the first training 
period. After the convergence of the tap 
coefficients it is sufficient to update them with 
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Fig. 13—Block diagram of decision feedback equalizer. 


a very long time constant, so the adaptation of 
the tap coefficients is carried out when the 
isolated pulse pattern (0100) in the received 
data train and the all O pattern in the trans- 
mitted data train are detected at the same time, 
thus avoiding the interaction between EC 2 and 
DFEQL. The DFEQL shares an 8-bit D/A 
converter with the EC 2. 


5.4 Timing extraction circuit 

A new DPLL circuit was developed to 
improve the jitter characteristics of the re- 
generated clock. 

Jitter has the following causes: 
1) Pattern jitter 

Due to intersymbol interference. 
2) Idling jitter 

Due to the control mechanism of the DPLL, 
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which is estimated to be the ratio of the baud 
rate to the master clock frequency. 
3) Free-running jitter 

Due to the frequency offset between the 
master clock frequency of the ET and that of 
the NT. 

In this method, the clock signal is extracted 
only from the fixed pulse pattern (FP/TP). 
Therefore, the pattern jitter due to intersymbol 
interference is negligible. Since the master clock 
frequency is set to 15.36 MHz for a baud-rate of 
160 kbit/s, the idling jitter width is estimated to 
be between one and two percent. The main jitter 
factor is the free-running jitter. The authors’ 
proposed DPLL control method achieves stable 
timing extraction even when the frequency 
offset is more than +100 ppm between the ET 
and the NT. This is realized by measuring the 
frequency offset during the initial training 
period, and assigning the insertion/drop control 
timing whose frequency is a function of the 
measured frequency offset. The timing phase 
jumps caused by the insertion/drop control of 
the DPLL may cause a slight degradation of the 
echo canceller performance, but it is found that 
the total system performance is barely influ- 
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enced by such timing phase jumps. 

Figure 14 is a schematic diagram of the 
DPLL training procedure. 

1) Initial convergence period (f, ) 

After the line equalizer converges, the DPLL 
is phase-locked to the pulses received from the 
far end. 

2) Free-running period (f2 ) 

After the DPLL is phase-locked, the input of 
the received pulses to the DPLL is inhibited for 
one frame period (7+). The DPLL goes into free- 
running operation during this period and the 
phase difference (6) appears due to the 
frequency offset. 


220 


3) Detection of the frequency offset (f3 ) 

After free-running, the DPLL is again phase- 
locked, and during this period the number of 
received input pulses that needed to be phase- 
locked (N) is counted, which can be regarded as 
the frequency offset between the ET and the NT 
per frame. The direction of control (drop/ 
insertion) (P) is also memorized. 

4) Compulsory control of the DPLL (fq) 

Based on the results of P and N obtained 
with the above procedure, the DPLL must be 
subsequently controlled N times for each frame 
during the free-running period. As a result, the 
free-running jitter is suppressed. During the 
timing extraction period using the FP/TP pulses, 
the DPLL operates normally. 

Figure 15 is a block diagram of the new 
DPLL circuit. The following circuits have been 
added to a standard DPLL circuit. The frame 
detector (FRAME DETECT) sets the frame 
period before frame synchronization is ac- 
complished. The input controller (INPUT 
CONT) allows only timing pulses (TP) and 
compulsory pulses to pass through to the DPLL. 
The phase detector (PHASE DETECT) detects 
the convergence of the DPLL and_ also 
memorizes the direction of the drop/insertion 
control. The offset counter (OQFFSET COUNT) 
detects the frequency offset. Based on the 
output of the OFFSET COUNT, the frame 
counter (FRAME COUNT) sets the timing of 
compulsory control of the DPLL by dividing the 
frame period into N equal intervals. 


5.5 Activation procedure 

To ensure quick and stable convergence, 
an initial training procedure for each functional 
block, such as BN, ,/f AGC EQL, DFEQL, 
DPLL and EC, is provided. Figure 16 shows the 
initial training procedure for the EC LSI when 
an activation request arises from the ET side. 

The procedure starts when the signal power 
is detected. The equalizers,,/f AGC EQL, DPLL 
and DFEQL, are trained in that order using the 
training pattern (1/4-bit pattern: repeated 
pattern of a 1 pulse followed by three 0 pulses). 
At the same time the optimal balancing network 
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Fig. 17—Microphotograph of the EC chip. 


characteristic is chosen. Following this sequence, 
EC 1| and EC 2 are trained with the training time 
for EC 2 being a little longer than that for EC 1 
using the special training pattern which contains 
every combination of 8-bit patterns. To keep the 
timing clock phase correct, the _ specially 
arranged frame format is used in the training 
period. Four timers, ¢; to f4, are used to control 
the sequence, and are set up by a built-in MXP 
timer. 


6. ECLSI 

All the functional blocks mentioned above 
were implemented on the single-chip EC LSI. 
Figure 17 is a microphotograph of the EC chip. 
The LSI was designed using an analog/digital 
CMOS process!°, Table 2 also lists the 
specifications of the EC chip. 

The 8-mm x 9-mm chip was designed includ- 
ing 4500 basic cells, a ROM (32 W x 8 bits) and 
two RAMs (512 W x 11 bits and 256 W x 7 bits) 
in the digital section with operational amplifiers, 
comparators, capacitors, resistors, and two D/A 
converters in the analog section. The block 
layout was done by separat*ng both sections 
with power supply lines and a test circuit to 
avoid interference between the analog and 
digital parts. 

Data transfer between the analog and digital 
sections is through the test switching circuit, 
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Table 2. Chip specifications 
Kem 


Physical 
Channel length 2.3 um 
Chip size 8 mm x 9 mm 
Package RIT-64 
Organization | 
Randam gate scale | 4 500 basic cells 
Operational 12 
amplifier 
Comparator i 
ROM 1 (32 W x 8 bits) 
RAM | 2(512 W x 11 bits) 


(256 Wx 7 bits) 


D/A converter 2 (7 bits, 8 bits) 


Others Resistor, and capacitor 
: | oe dete 
Operational 
Supply voltage > V. 


Power dissipation 150 mW 


which enables these sections to be tested 
independently. When the digital section is 
tested, all I/O signals are routed to the tester 
which generates pseudo I/O signals. During the 
dc test of the analog section, digital control data 
is directly applied from outside through the test 
switching circuit and every amplifier, com- 
parator, D/A converter, and Vf EQL are tested. 
The EC LSI also features an initial training 
control circuit which generates the initial train- 
ing sequence of each functional block by 
receiving training patterns from the multiplexing 
processor LSI. 


7. Performance characteristics 

The total DSL system was installed and 
evaluated in terms of eye openings, eye dia- 
grams, jitter characteristics, sinusoidal crosstalk 
margin, and activation time. It was tuned to 
a line bit rate of 160 kbit/s. 

To evaluate the performance of all equal- 
izers, eye Openings were measured by changing 
the lengths of the 0.5 mm diameter cable. The 
results are shown in Fig. 18. The bottom line is 
without echo cancellation. For the middle line, 
only EC | is operating. The top line is when the 
two stage echo canceller operates. It can be seen 
that impaired eye openings due to residual 
echoes are restored by EC 2, especially for long 
cable lengths, and consequently good eye 
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Fig. 20—Jitter characteristics. 


openings of more than 70 percent can be 
obtained. 

Figure 19 shows example eye diagrams at 
the output of the DFEQL of the NT side. The 
diagram in Fig. 19-a) was obtained by connect- 
ing the NT and the ET through a 6-km cable of 
0.5-mm diameter with an attenuation of 45 dB 
at 80 kHz. The diagram in Fig. 19-b) represents 
the case when two bridged taps are added to a 
4.5-km cable, with a total attenuation of 47 dB 
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0.5 mm dia. 6.0 km 


Loss: 45 dB at 80 kHz 


a) Without bridged taps 


0.5 km 
Pails 
0.5mm dia. 4.5 km 
0.5mm dia., 0.5 km x 2 
Loss: 47dB at 80 kHz 


b) With bridged taps 


Fig. 19—Eye diagrams. 


at 80 kHz. In both cases clear eye openings are 
obtained at the decision point. 

The jitter characteristics of the extracted 
clock were measured with different frequency 
offsets between the NT and the ET. The results 
are shown in Fig. 20. This was measured without 
a high-pass filter, which is a more severe test 
condition for jitter measurement, but still meets 
the five percent peak to peak requirement. 

To evaluate the noise margin, the error rate 
was measured in the presence of sine wave 
interference with an amplitude X. The results 
are shown in Fig. 21. In this measurement, the 
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Subscriber line: 0.5mm dia. 


10-4 


Error rate 


107° 


1077 


Length (km) 


S/X (dB) 


Fig. 21 —Error-rate characteristics by equalizing 
waveform marginal checking. 


Fig. 22—Activation time. 


cable lengths were changed from | km to 6 km 
and frequency offset was more than 30 ppm. 
A good S/X ratio of less than 12.7 dB at an error 
rate of 10°7 was obtained for a variety of loop 
lengths. 

Figure 22 shows the measured activation 
time of the EC LSI. LTRD-N, ECRD-N, and 
SYNC-N indicate the convergence of the line 
equalizers including DPLL, that of EC, and the 
accomplishment of frame synchronization, 
respectively, at the NT side, while LTRD-E, 
ECRD-E, and SYNC-E indicate those of the ET 
side. The total activation time from start-up to 
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the establishment of bidirectional error free 
transmission was less than 220 ms. 


8. Conclusion 

This paper described an echo canceller LSI 
for ISDN 
A specially arranged time-compressed frame 
format combined with the use of a newly 


subscriber loop transmission. 


developed DPLL circuit provides excellent 
timing extraction performance. The approach 
requires only 8 bits precision for the D/A 
converter, which made the LSI implementation 
easier. As the EC LSI has been developed and 
combined with a processor LSI for multiplexing 
and circuit termination, it can perform basic 
DSL equipment functions. Experiments were 
carried out using a line transmission rate of 
160 kbit/s and the AMI code. As a result, more 
than 6-km transmission on 0.5-mm twisted-pair 
cable was found to be possible. 
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Considering high-power transmission of high-capacity digital radio system such as a 
256 QAM, this paper describes a method for determining the conditions under which a new 
digital system can be installed without disturbing the already existing analog FM systems. 
The ratio of the FM signal to interference level is calculated as a function of the channel 


separation between the FM and the digital systems, using various parameters such as bit 
rate, rolloff factor, and modulation level of the digital system. This paper also shows that 
spectrum shaping of the QAM system by cosine rolloff filter results in a more rapid decrease 
in the interference level for increase of the channel separation than that of the conventional 
PSK system using a 5-section Butterworth filter with BT of 1.2 as a transmit filter. 


1. Introduction 

Improvement of spectrum efficiency has 
been the most important task in field of micro- 
wave digital radio systems in for the last ten 
years. To this end, 16-QAM and 64-QAM digital 
radio systems have already been used in the 
field)? Even 256-QAM systems have been 
developed and are now field-tested in Japan by 
NIT? 

However, a system with a higher level of 
modulation requires higher transmission power. 
For example, a 256-QAM system requires 
a transmission power eight times that needed 
by a 16-QAM system when the repeater spacing 
and bit rate are the same. The increased trans- 
mission power means that the system can 
strongly interfere with adjacent systems. Thus, 
to install a new multi-level QAM system, it is 
necessary to accurately estimate the conditions 
under which communication quality of sur- 
rounding systems are not degraded. 

Since many analog FM systems have been 
installed nearly everywhere, this paper estimates 
degradation of their communication quality 
caused by installation of the multi-level QAM 
system. Though interference between the analog 


FUJITSU Sci. Tech. J., 24, 3, pp. 225-234 (September 1988) 


FM systems and digital PSK systems has been 
estimated”’®), spectral shaping of the multi-level 
QAM system differs from that of the conven- 
tional PSK system cited in reference 7. As this 
paper will indicate, the difference between the 
PSK and the QAM system is considerable. 
Thus, spectral shaping using a cosine rolloff 
filter is assumed in this paper. 

Spectrum of the analog FM signal is required 
in the estimation mentioned above. A consider- 
able number of papers”! were published in 
early 1970 to give analytic expression of the FM 
spectrum. Of these papers, reference 9 has given 
a rigorous autocorrelation function of the FM 
signal. Authors have expanded the autocorrela- 
tion function into a Taylor series to give an 
analytic expression of the FM spectrum. How- 
ever this expansion has made mathematical 
procedure rather complex. On the other hand, 
computer simulations”’® of the FM spectrum 
have also been reported, and computer simula- 
tion has a simpler mathematical procedure than 
those of references 9 to 11. However, computer 
simulations are time consuming. This paper 
adopts an intermediate method between the 
computer simulation and the analytic approach: 
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Equation (12) of reference9 is numerically 
estimated to simplify the mathematical 
procedure. Since the computation time of 
computers has substantially decreased recently, 
the FACOM M-380 high-speed digital computer 
is able to give the FM spectrum with 400 
sampling points in frequency within just 4 s. 

This paper is composed of 5 chapters. 
Chapter 1 is the introduction. Chapter 2 de- 
scribes the calculation method of interference 
from the digital QAM system to the analog FM 
system and has three sections. Section 2.1 gives 
the autoccorrelation function of the analog FM 
signal, Sec. 2.2 gives the spectrum of the FM 
signal, and Sec. 2.3 gives the signal to inter- 
ference noise ratio (SIR) of each FM channel. 
The calculated results of the interference are 
given in Chap.3. Chapter 3 has two sections. 
Section 3.1 gives the calculated FM spectra, 
and Sec. 3.2 shows the dependence of signal to 
interference noise ratio on channel separation 
between digital QAM and FM systems. Com- 
parison of the dependence for various system 
parameters such as bit rate, is also given. After 
the conclusion in Chap. 4, an appendix is given 
in Chap.5 to supplement the mathematical 
procedure of Sec. 2.1. 


2. Calculation method of interference form 
digital QAM system to analog FM system 
2.1 Autocorrelation function of the analog FM 
signal 
As described in the introduction, the esti- 
mation of interference requires spectrum of the 
analog FM signal. Since the spectrum is given 
by an inverse Fourier transformation of the 
autocorrelation function, its analytic expression 
given by Rowe et al.” is briefly reviewed in 
this section. 
Figure 1 is a block diagram of an analog FM 
modulator. The data source of this figure 
gives a signal whose statistical properties are the 


Signal 


Fig. 1—Block diagram of FM modulator. 
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same as those of a white Gaussian random 
variable. Since a low-pass filter (LPF) eliminates 
the higher frequency components, the output 
signal of the LPF accurately represents the 
actual baseband signal. 

When pre-emphasis!” is applied to the base- 
band signal, the amplitude transfer function of 
the LPF is given by the following equation. 


(10)'*[ { 5.25 @,w)? + @, — 
— w?)? } / 1 5.25(w,w)? + 

+159 eo moe ¥ 1’ 

: (w, <|w |< Wmax) 


0(|w |<, or | w|]> max), 


H (w)= 


where Wmax and w, are the maximum and mini- 
mum angular frequency of the baseband signal, 
respectively. The resonant angular frequency w, 
is related to the maximum baseband frequency 
by the following equation: 

Gy = 1.25 Odin ax s 

When pre-emphasis is not applied, H (w) is 
unity for frequencies lower than the maximum 
baseband frequency. The output signal of the 
LPF is fed to a voltage controlled oscillator 
(VCO) whose oscillation frequency deviates 
from free running frequency proportionally 
to the input signal voltage. 

The output signal vy, (¢) of the LPF is 
related to the output signal of the data source 
vo (t ) as follows: 


+ 00 

n(t) =). A(t,u)r(u)du, —.. (3) 
where A(t,u) is the impulse response of the 
LPF and is given by 


h(t,u) =f" H(w) exp} jolt — u)}- 
A | eta 


AAGee wasn aemeoee (4) 


Since v; (f) is the input signal of the VCO, 
its output signal vz (¢) is expressed as 


v2 (t) = Voexp | jk J‘) vi (u)adu }, 


where k is the constant representing frequency 
deviation of the VCO. In this expression, the 
carrier component exp (jw,t) is omitted for 
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simplicity of notation. The autocorrelation 
function R (7) of v2 (ft) is given from its de- 


finition’”, 
R @) = { vex(t+7) 2 *(t) | 
= Vo [exp } ze vy; (u)du |), 
sas eS ov we le Se (6) 
where | ..._ | means the ensemble average of 


all possible values of v2(t) and * means 
complex conjugate. Substituting Equation (3) 
into Equation (6) gives 


R @) = Vor Lexp | jk |” v9 (u)- 
“SF (ir. pa? I, Beek Ge 


where the order of integration is changed and 
f(t, 7, ) is given by 


fie = | OA” cians (8) 


The ensemble average of Equation (7) is 
calculated by considering white Gaussian nature 
of vo (t) and gives the autocorrelation func- 
tion”, 

R (tr) = Voz exp [ Ss Ai | Hw) |? 

~W m ax 
- (1 — coswr)/w* x dw J, 5a (3) 
where p is the baseband signal power within 
one hertz. Derivation of Equation (9) from 
Equation (7) is given in the appendix. The 
argument of the exponential function in 
Equation (9) is directly estimated with numeri- 


cal integration. 


2.2 Power spectrum of analog FM signal 

As is well known, the power spectrum is 
obtained by Fourier transformation of the 
autocorrelation function’. Thus, the power 
spectrum density P (w) of the FM signal is 


ae ifr eS : 
P w)=Vo i exp (wT) - 
exp [=p |“ ™™ | way}. 
~W max 


a cosAT)/A7dd ] dt. «(TOY 


The total transmission power of the FM 
signal is obtained by integrating Equation (10) 
with respect to w, 


ae 2 
[ P @)dw=27Vo", 


where the following integral expression of the 
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Dirac delta function is used in the derivation of 
Equation (11): 
+ co 
{ exp (jwT)dw=2756 (rT). 


Thus, the normalized FM spectrum Ppy (w) 
is obtained by dividing equation (10) by 2nVo?: 


sare h2) 


Prem (w) = 1/20 J exp (jw): 


-exp [ kp oe H e.8 Tae 


- (1 — cosAr)/A*dA] dr. ...(13) 


Since the carrier component is eliminated, 
the angular frequency represents the deviation 
from the carrier. 

In experiments, the power given to each 
channel is usually used instead of spectral power 
density. It may be reasonable to give the FM 
spectrum which can be compared with the 
experimental model, although this comparison 
is not made here. The power for a channel is 
obtained by integrating Equation (13) within 
the channel bandwidth B: 


P. @)=B {exp (jwr):{sin (nBr)/ "Br - 
-exp [ —k*p shies 1H (dr) }?- 
-(1 — cos Ar)/A?7 dA] dr. .. (14) 


The power spectrum given by Equation (14) 
depends on parameters k and p. Since pre- 
emphasis does not influence the total baseband 
power, the baseband power density p is given by 
the total baseband power Pj, and bandwidth, 


The modulation sensitivity k is usually 
expressed in terms of the frequency deviation fy 
corresponding to unit input power Po of the FM 
modulator: 

2nfy 
= (2Py 12 


2.3 Interference from digital QAM system to 
analog FM system 
This section discusses the quality of de- 
modulated FM signal. A signal to noise ratio 
(SNR) high enough to give good communication 
quality is assumed when there is no interference 
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from the digital QAM system. Considering the 
interference from the digital system as noise, 
signal to interference noise ratio (SIR) is defined 
as a power ratio of the FM wave to the interfer- 
ing wave. An equation giving the SIR is already 


given by Stojanovic et ah 58. 


S/T =(k? Poy | max )1(277/w)? . 
‘{H @)}?/ 
(°° P. @—A) Pog AAA}, 


where Ppg (w) is the power spectrum of inter- 
fering wave. The transmission power spectrum 
of the digital QAM signal is the same as the 
output power spectrum of transmit filter for 
white input spectrum. In this paper, it is 
assumed that filters are equally shared between 
the transmitter and receiver. In other words, 
amplitude transfer function of the transmit 
filter is assumed to be the square root of a 
cosine rolloff filter. The power spectrum of the 
interfering signal is normalized so that integra- 
tion of the spectrum over frequency is unity, 
1:6:, 


Pog &)= 
T/(2n) (\f | /fo <1 —@) 
—— [1 +xvosr | f fo =) } 
4n 2afo 


(1 —a<f/fp <1 +a) 
OCIFI/fo >1 +a) 


where J is the symbol period, a is the rolloff 
factor and fo is the Nyquist frequency | fo = 
1/(2T) |. Since normalization eliminates infor- 
mation of strength of the interference, the 
strength is considered in parameter 7 of 
Equation (17), which is the ratio of the total 
desired power to the total undesired power. 


3. Calculated results of interference from QAM 
system to FM system 
3.1 Spectra of FM transmitters 
Figure 2 shows the power spectra of FM 
960-channel signals. The solid curve shows 
the spectrum with pre-emphasis and the dashed 
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FM 960-channel 


Spectral power (dB) 


5 =I = 5 0) 5 10 15 
Frequency (MHz) 


Fig. 2—Spectra of FM 960-channel transmitter. 
Frequency deviation of 200 kHz per 0 dBm 
input is assumed. Baseband signal is assumed 
to have the frequency range from 60 kHz to 
4.188 MHz. Average power of —15 dBm is 
assumed for each baseband channel. 


curve shows that without pre-emphasis. In this 
figure, a frequency deviation of 200 kHz per 
QdBm input power of the FM modulator is 
assumed. The input power level of the FM 
modulator is —15 dBm per channel. The mini- 
mum baseband frequency is 60 kHz and the 
maximum baseband frequency is 4.188 MHz. 
The horizontal axis represents the relative fre- 
quency to the carrier and the vertical axis 
represents the normalized power spectrum. 
Parameter B of Equation (14) is assumed to be 
4 kHz. 

Figure 3 shows the power spectra of FM 
1 800-channel signals. A frequency deviation of 
140 kHz is assumed for 0 dBm input power of 
the FM modulator. The minimum baseband 
frequency is 312 kHz and the maximum base- 
band frequency is 8.204MHz. The ratio of 
the minimum baseband frequency to the maxi- 
mum is larger in the case of the | 800-channel 
than that of the 960 channel. The two dips 
on both sides of the carrier correspond to 
an absence of baseband signal near DC. These 
dips are also observed in the calculated spectrum 
shown in reference 11. A comparison of Figs. 2 
and 3 shows that larger energy is concentrated 
on the carrier for the 1 800-channel FM signal 
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Fig. 3—Spectra of FM 1 800-channel transmitter. 
Frequency deviation of 140 kHz per 0 dBm 
input is assumed. Baseband signal is assumed 
to have a frequency range from 312 kHz to 
8.204 MHz. 


than the 960-channel FM signal. 


3.2 Degradation of signal to noise ratio of FM 
signal caused by interference from digital 
QAM system 
This section discusses the quality of the FM 

signal. SIR of the FM signal calculated by 
Equation (17) includes 7, which is the power 
ratio of the desired FM signal to the undesired 
digital QAM signal (DUR). The ratio differs for 
each combination of the two systems. Since the 
ratio can easily by calculated, it may be con- 
venient to subtract DUR from SIR. 

The purpose for estimating the SIR is to 
determine the conditions under which all FM 
channels communicate with little interference. 
The first step in the estimation is to find the 
channel with the worst SIR. Figure 4 shows 
the SIR of the FM 1 800 channel signals. The 
interference for the solid curve is given by 
a 64-QAM system, and that for the dashed 
curves is given by a 16-QAM system. In both 
cases, a bit rate of 90 Mbit/s is assumed. Since 
the baud rate is obtained by dividing the bit rate 
by information transmitted per symbol period, 
the baud rates of the 16-QAM and 64-QAM 
systems are 22.5 Mbaud and 15 Mbaud, respec- 
tively. In this figure, the carrier of the FM 
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16-QAM 


S/I-D/ U (dB) 


64-QAM 


~ 
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Fig. 4—Signal to interference noise ratio (SIR) of each 
channel. Both 16-QAM and 64-QAM digital 
systems have a bit rate of 90 Mbit/s and rolloff 
factor of 50 percent. The carrier of digital 
system is assumed to be located at the carrier of 
the FM 1 800-channel system. 


system coincides with that of the digital system. 
A rolloff factor of 50 percent (~@=0.5) is 
assumed for the digital system. As can be 
seen from the figure, the channel with the 
worst SIR differs from the top channel. This 
difference is also shown in reference 16 for 
interference between the FM systems. 

Next, the SIR of the worst channel is 
estimated as a function of the channel separa- 
tion between the digital and the FM system. 
The SIR is a function of the channel separation 
and depends on various system parameters 
such as channel number of the FM system 
and bit rate of the digital system. First, the 
dependence on the parameters of the digital 
system is examined, and then the dependence 
on the FM system will be estimated. The FM 
960 channel signal is considered in Figs. 5 to 8. 

As the first example, a comparison of the 
difference caused by spectrum shaping is shown 
in Fig. 5. In this figure, the rolloff factor is 
assumed to be 0.2 (20 percent). A 5-section 
Butterworth filter with BT of 1.2 is assumed for 
the dashed curve in accordance with that of 
reference 7. The same baud rate of 22.5 Mbaud 
is assumed to give almost the same 6 dB band- 
width. As can be seen from the figure, the dif- 
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Fig. 5—Comparison of the worst channel SIR for 
different interference sources. Modulation is 
4-PSK and the baud rate is 22.5 Mbaud. Solid 
curve corresponds to the spectrum shaping by 
a rolloff filter with a rolloff factor of 0.2 and 
dashed curve corresponds to that by a 5-section 
Butterworth filter with a BT product of 1.2. 
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Fig. 6—Dependence of the worst channel SIR on rolloff 
factor of digital system. 


ference between the two curves cannot be 
neglected. The difference is caused by the 
difference in spectrum shaping, as indicated 
in the introduction. 

Since spectrum shaping depends on the roll- 
off factor, the dependence of the SIR of the 
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Fig. 7—Variation of the worst channel SIR for various 
modulation level of digital system. The same bit 
rate of 90 Mbit/s is assumed for all modulation 
levels from 4-PSK to 256-QAM. In the 4-PSK 
system, a 5-section Butterworth filter with a BT 
of 1.2 is assumed as spectrum shaping filter. 


worst channel on the channel separation is 
estimated for 20 percent and 50 percent of 
rolloff factor. The two values can be regarded 
as upper and lower limits for the following 
reason. In the high capacity digital radio system, 
a rolloff factor larger than 50 percent is not 
suitable in practical use because of its lower 
spectral efficiency. However, a filter with a 
lower rolloff factor requires more accurate 
design and 20 percent can be considered as the 
lower limit attained by the present state of the 
art. As can be seen from the Fig. 6, the differ- 
ence of the rolloff factor between 20 percent 
and 50 percent is much smaller than that 
between the Butterworth and rolloff. Since the 
difference of the rolloff factor between 
50 percent and 20 percent is not large, the roll- 
off factor of 50 percent is used in the following 
calculation. 

Next, the variation of the worst SIR by 
modulation level of the QAM system will 
be discussed. Hitherto, comparisons have been 
done under the same baud rate because our 
attention has been focused on the effect caused 
by difference in spectrum shaping. However, 
a higher level QAM system can transmit the 


FUJITSU Sci. Tech. J., 24, 3, (September 1988) 


Y. Daido and H. Nakamura: Interference from Digital QAM System to Analog FM System 


| 
90 Mbit/s || U 


H/ 


/ 


mg 2 / 


S/I-D/ U (dB) 


FM 960-channel 
256-QAM 


a=0.5 


0 10 20 30 40 50 
Channel separation (MHz) 


Fig. 8—The worst channel SIR vs. channel separation for 
various bit rates. 


information in the same bit rate using a smaller 
bandwidth. The effect caused by bandwidth 
reduction should be considered. Figure 7 
compares the worst SIR for different modula- 
tion level. The bit rate of 90 Mbit/s is assumed 
in this figure. This figure clearly shows that 
the QAM system with a higher modulation 
level can be installed with smaller channel 
separation than that with a lower modulation 
level. 

Dependence of the SIR on bit rate of the 
256 QAM system is shown in Fig.8. As ex- 
pected, larger channel separation is required 
for the system with a higher bit rate. The 
channel separation required for a_ specified 
system can be determined using this figure. 
When the channel separation is specified, this 
figure gives the maximum possible bit rate of 
the 256 QAM system. 

Figure 9 shows the difference in the SIR 
for various channel numbers of the FM system. 
The 135-Mbit/s 256-QAM system with a rolloff 
factor of 50 percent is assumed as the inter- 
ference source. The FM system with a larger 
channel number requires a larger channel separa- 
tion, as shown in the figure. 

Figures 5 to 9 determine the channel separa- 
tion well enough to maintain good communica- 
tion quality in the FM systems. 
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Fig. 9—Dependence of the worst channel SIR on channel 
number of FM system. 


The required channel separation is de- 

termined by the following procedure: 

1) Specifying the required SIR, 

2) Calculating DUR, the ratio of the trans- 
mission power of two systems, 

3) Subtracting the DUR from the required SIR 
and 

4) Reading the channel separation where SIR is 
equal to the difference obtained by item 3) 
above. 


4. Conclusion 

Spectrum of analog FM signal was estimated 
by calculating autocorrelation function through 
numerical integration. Numerical integration 
gives rigorous results with a simplified mathe- 
matical procedure. The estimation method 
gives the FM spectrum within 4s of computa- 
tion time, with 400 sampling points of fre- 
quency. 

The FM spectrum was used to estimate 


the interference from digital QAM systems to 


the FM system. Signal to interference noise 
ratio was estimated for each channel of the FM 
system. This calculation showed that the top 
channel is not always the worst channel. 
Dependence of the worst signal to inter- 
ference noise ratio on channel separation be- 
tween the FM and the QAM system was 
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calculated to determine the channel separation 
with which the digital QAM system can be 
installed without disturbing the FM system. 

These results are considered in estimating 
the feasibility of installing a digital QAM system 
in a channel adjacent to the FM system. After 
the possibility is confirmed, the method 
described in this paper gives useful information 
for selecting filters in the transmitter of the 
digital QAM system. 

Since SNR of the FM signal will be degraded 
by fading, a reasonable value for the flat fade 
margin should be considered. In addition, 
nonlinearity of high power amplifier spreads 
the spectrum of the QAM system and increases 
the interference from the digital QAM system. 
An additional margin should be assumed besides 
the flat fade margin. 

Though this paper does not describe the 
interference from the FM system to the digital 
QAM system, the FM spectrum of this paper 
can directly be used for estimating the inter- 
ference. 


Appendix 
1. Calculation of statistical average of Equa- 

tion (9) 

Equation (7) can be rewritten in the form 
of summation according to the definition of 
an integral: 

R (r) = Vo" lim [exp | jkAu = f- 

Au-0 


n=-2 


-(t,7T,nAu)vo(nAu)}]. 


Since the argument of the exponential func- 
tion is the summation, Equation (A-1) can be 
expressed as a product of the exponential 
functions. Input voltage v9 (ft) of the low-pass 
filter in Fig. 1 can be regarded as a white 
Gaussian random variable whose values sampled 
at different instatnts are statistically in- 
dependent. The average of the product of 
statistically independent variables is equal to 
the product of the averages!” 


Thus, Equation (A-1) becomes 
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R(r) = Vo lim) TL [exp | jk Auf: 
Au>0 


n=-0 


-(t,7,nAu)vgo(ndu) |]. 


Here, the difference between discretely 
sampled values of vg(t) and the original con- 
tinuous variable should be considered. Since 
uniform sampling is used in Equation (A-2), 
sampled values of v9(t) have spectral com- 
ponents between —7/Au and m/ Au. Thus, the 
total power of the sampled values is equal to 
p(2m7/Au) where p is the spectral power 
density of the original vp (f), 


[| v(nAu) |? 1] = p(QQa/Au). 


The average power of the sampled value is 
equal to its variance. Since the Gaussian nature 
of variable v9(t) is assumed, the probability 
density function P (vp ) of the sampled value is 


P(%)= (Au/4mp)'? exp (— Auvo’ /4mp). 


This probability density function is used to 
calculate the ensemble average of Equa- 
tion (A-2): 


<exp | jkAuf(t,7,nAu)vo(nAu) |> 
=<f oti (Au/4mp)'!? exp [ Au - 


‘| ikf(t,7,nAu)vo — vo /4ap}dvo|>. 


Equation (A-6) can be rearranged as 


<exp} jkAuf(t,7,nAu)vp (ndu) |> 
=exp [—mpAu | kf(t,rt,nAu) |*]-1, 


where, 
l= (Au/4np)'? i exp[—Au- 


| ¥9 —j2mpkf(t,t,nAu)|} lay 
4np ae 


Since the integrand of Equation (A-7) 
has no singular point, integration can be carried 
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out along a contour on which the argument 
of exponential function is real. This contour 
integral is equal to the integration of Equa- 
tion (A-4) with respect to v9 over the whole 
range. Thus J of Equation (A-7) is unity. Sub- 
stitution of Equation (A-6) into (A-2) gives 

R (rt) = Vo lim Tl exp[—apAu- 

Au>0 


| ef Gt, c.nAuy 71, 2. th8) 


The product of exponential functions can 
be expressed by an exponential function whose 
argument is the sum of all arguments of factored 
exponential functions given in Equation (A-8), 
1¢.; 

R (r) = Vo’ exp [—mpk? So : 


-4 f(t,r,u)} 47dul, ...A9) 


where summation is replaced by the integral in 
accordance with the definition of an integral. 
Estimation of R (7) using Equation (A-9) is 
time consumming, as the function f(t,7,u) 
in Equation (A-9) is given as an integral ex- 
pression (see Equation (8) in the text). To 
reduce computation time, Equation (8) is 
substituted into Equation (A-9). The integral 
F(t, 7) in Equation (A-9) is 
+ 


F (t, 1) =| 7 | f(t,7,u) }?du 


t+ Fir Te 
= J, i. h(x,u): 
-h(y,u)dudxdy , ogee. CAH10) 


where the order of triple integration is changed. 
Since the range of the integral with respect to 
u extends infinitely in the positive and negative 
directions, a closed expression can be expected 
for the integral. 

Let Q(x,y) be the integral with respect 
to u, 1.€., 


Q(x,y)= ee (x, u)h (y,u)du. 


To calculate Q (x,y), Equation (4) in the text 
is substituted into Equation (A-11): 


= ‘i i fH @) HO): 


O(x,7)= 


-exp | jw (x —u)t+jA(—u)]- 
-dwdddu . 
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The order of integration is changed again 
and the integral with respect to uw is given by the 
Dirac delta function’, i.e., 


1 pte ah 
oo exp } j~(wtaA)u; du 


= 6 (wtdA). 


Substitution of Equation (A-13) into Equa- 
tion (A-1 2) gives 


1 +00 & +00 
TCAD oe [ H(w)H A)6 (w +d): 
-exp | j @xtAy) | dwdr 
1 p+ 
=5- |. H ©) Hw) - 


-exp | jw (x -—y)} dw. 


Since the function H (w) is zero in the range 
where the modules of w is larger than w,,,x , the 
range of the integral in Equation (A-14) is 
restricted to the finite range where |w| is 
smaller than Wx. Equation (A-14) is used in 
Equation (A-10) and F (tf, 7) is given as 


l + W max 
F(tt)=5-\_, Ht @) A Cw): 


\ t+T 2 
a i exp(jwx )dx: 


[Texp(—jwy) dy ldw. (A-15) 
t 


Equation (A-15) can be simplified by carry- 
ing out the integrals with respect to x and y, i.e., 


1 ¢+w ms 
F(t,1)=— JO" H @)H (a): 


1 — cos( wrt 
(a le, 
ro) 
Substitution of Equation (A-16) into Equa- 


tion (A-9) gives Equation (9) in the text. 


(A-16) 
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This paper presents a newly developed transfer method for electrophotographic printers. 
This method uses a medium such as silicone rubber to which a toner image is first transferred 


by pressure and from which the image is then retransferred and fixed onto paper by pressure 
and heat. The resulting image resolution of 1000 dots per inch is due to less toner scatter. 
This method is suitable for in-house printing and desktop publishing applications. 


1. Introduction 
The primary application of the electrophoto- 

graphic process is in xerographic copiers and 

laser printers. 

This process has seven steps: charging the 
photoconductor, exposing it to produce a latent 
electrostatic image, developing the image with 
toner, transferring the toner image to paper, 
fixing the image, cleaning the photoconductor, 
and erasing surface charges on the _ photo- 
conductor. 

There are two problems inherent in conven- 
tional electrostatic transfer’ of toner images by 
electrostatic force: 

1) The quality of the transferred image can be 
degraded by nonuniformity in the electric 
transfer field. 

2) Toner that has low electrical resistivity 
(<10!4 Qem), and which easily produces 
a sharp toner image, is difficult to transfer 
to plain paper, which leaks an electric 
charge. 

The expansion of in-house printing systems 
(IPS) and desktop publishing (DTP) has in- 
creased the need for printers that provide 
better quality imaging. Developments in electro- 
photographic technology have provided toner 
images with higher resolution, but the resolution 
of electrostatic transfer images is limited. 


The nonelectrostatic transfer method 
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we studied uses a medium that has a high 
affinity for toner and low free surface energy 
(< 40 dyn/cm). 

The transfer principle and the results of 
experiments exploring various transfer character- 
istics are explained in the following sections. 


2. Transfer principle 
Nonelectrostatic transfer involves adhesive 


Recording drum 


a 


Pressure roller 


a) Adhesive transfer 


Backup roller 
=. 


a 


Toner image” 


Transfer 
medium 


Transfer 
medium 


Toner image 


Paper 


Heat roller 


b) Fusing transfer 


Fig. 1—Transfer mechanism. 
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(first) and fusing (second) transfers. 


2.1 Adhesive transfer 

In the first transfer zone, the transfer 
medium is brought into contact with the record- 
ing drum by a pressure roller | see Fig. 1-a) | 
The transfer medium has a high affinity for 
toner, a low level of free surface energy, and 
is heat resistant. Materials having these 
properties include silicone rubber. 

The toner image that forms on the recording 
drum passes between the recording drum and 
transfer medium and is pressed against the 
transfer medium by a pressure roller. The 
adhesive force of the transfer medium, which 
exceeds the electrostatic force of the recording 
drum, lifts the toner image off the drum onto 
the transfer medium. 


2.2 Fusing transfer 

In the second transfer zone, the second 
backup roller presses the transfer medium into 
indirect contact with the heat roller, the 
intervening material being the paper } see 
Fig. 1-b) |. The toner image having been trans- 
ferred to the transfer medium and carried to 
the second transfer zone is now pressed between 
the transfer medium and paper, where the toner 
is fused by heat passing from the heat roller 
through the paper. The fusing toner is repelled 
from the transfer medium surface because of 
the medium’s low surface energy. The heated 
toner is simultaneously pressed into the paper 
fibers and fixed permanently on the paper 
surface. 


Recording drum fale image 


Transfer medium 


First transfer ————> Silicone rubber 


SSF 0.4mm 
=> =i mm 


Pressure roller 


Backup roller 


Polyester 


Heat roller 


Fig. 2—Experimental setup. 
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3. Experiments 
3.1 Setup 

Figure 2. shows the experimental setup. 
The transfer medium is a continuous belt of 
0.1 mm polyester film base coated with 0.4 mm 
of silicone rubber, providing a transfer medium 
with a smooth surface and low free surface 
energy (about 20 dyn/cm). Low electrical 
resistivity toner (10° Qcm) and plain paper 
(Xerox L Paper) are used. 

First, the toner image is formed on the 
recording drum. In the first transfer zone, the 
image is transferred from the recording drum to 
the transfer medium by an adjustable spring- 
driven pressure roller. The image is carried to 
the second transfer zone, where it is pressed 
between the transfer medium and paper, heated, 
and transferred to the paper. The heat is 
provided by an adjustable halogen lamp. 


3.2 Evaluation 

Basic transfer conditions were investigated 
by adjusting parameters such as rubber hardness, 
roller pressure, and the surface temperature 
of the heat roller. 

3.2.1 Transfer efficiency and fixing ratio 

Transfer efficiency refers to the relative 
amounts of toner on the drum, transfer medium 
and paper before and after a transfer. 

Efficiency for the first transfer, n,, and for 
the second, n,, are given as 

nm, = D,/Do x 100 (%), 

N2 D,/D, x 100 (%), 
where 

Do): Optical toner-image density on the 

recording drum, 


i 


a) Raggedness 


b) Blur 


Fig. 3—Edge structure of toner image. 
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D,: Optical toner-image density on the 

transfer medium, and 

D,: Optical toner-image density on paper. 

Adhesion of the fixed image to paper is 
measured by applying adhesive tape 
(3M No.810 tape) to the fixed image at a 
constant pressure with an iron roller and then 
peeling the tape off. 

Fixing ratio f is given as 

f = D;/D, x 100 (%), 
where 

D,: Optical toner-image density on paper 

after the tape adhesion test. 

3.2.2 Sharpness 

In electrophotography, image sharpness is 
judged according to the concepts of raggedness 
and blur” (see Fig. 3). 

Raggedness results when not enough toner is 
transferred to the image edges or when the toner 
is scattered during transfer. Scattered toner is 
spread and fused in “clumps” during image 
fixing. 

Blur is usually the result of toner being 
scattered during transfer. 
changes the area of the screen dots and has poor 
gray-scale reproducibility. The randomness of 
the scattering makes screen dots uneven. 


Scattered toner 


For raggedness, we evaluated the rising 
width of the characteristic curve of a line image 
by measuring the area rate of the toner image 
in the direction of line width. 


200 em 


-_ 


Mask 


Fig. 4—Scanning mask. 
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The toner area rate (Fig. 4) is measured 
using a 1.3 x 200 wm scanning mask in an image 
analyzer. The curve in Fig. 5 is the toner area 
rate for a position in the line image. The rising 
width, wo, is defined by 

Wo =(w, +w,)/2, 
where 

W,,W2: Distance over which the toner area 
rate goes from 10 percent to 
90 percent. 

For blur, we evaluated the amount of scatter 
in the area rate of screen dots before and after 
toner transfer” 


100 


90 


area rate ( 


Toner 


Position along the scan 


Fig. 5—Rising width w, and w2. 
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100 nm 
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-100 ~m 


Fig. 6—Screen dot measurement. 
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First transfer efficiency (%) 


Pressure : 0.5 kgf/cm 


0 20 30 4() 


Rubber hardness (Hs) 


Fig. 7—Efficiency of first transfer as a function of 
rubber hardness. 


One hundred screen dots of 100 x 100 um 
are separated by a 200 x 200 um grid (Fig. 6), 
and for each grid, the area rate, x;, is measured 
by an image analyzer (toner separated from dot 
images included). After calculating the average, 
x;, and the standard deviation of area rate data, 
On 1, Scatter for the area rate in screen dots is 
given by On-; /X;. 


4. Results and considerations 
4.1 First transfer 

Figure 7 shows the efficiency of the first 
transfer for different hardnesses of rubber at 
0.5 kgf/cm. Efficiency decreases gradually with 
increased rubber hardness because _ elastic 
deformation decreases with increased hardness 
at a constant pressure. With increased hardness, 
the surface of the transfer medium comes into 
increased contact with the toner. When rubber 
hardness is too low (<30 °), rubber strength, 
especially tear strength, decreases to where the 
rubber cannot be used as a transfer medium. 
We used a transfer medium having a rubber 
hardness of 30 °. 

Figure 8 shows the efficiency of the first 
transfer at different pressures. Efficiency in- 
creases up to a saturation level of 94 percent 
at a pressure of 0.6 kgf/cm. 
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Fig. 8—Efficiency of first transfer as a function of 
pressure. 
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Fig. 9—Characteristics of transfer and fixing as 
a function of pressure. 


4.2 Second transfer 

Figure 9 shows the efficiency of the second 
transfer and the fixing ratio at different 
pressures in the second transfer zone when the 
heat roller surface temperature is 200 °C. 
Efficiency increases with pressure, reaching 
100 percent at 1.0 kgf/cm. The fixing ratio 
increases steeply up to 98 percent at 0.4 kgf/cm, 
and reaches 100 percent at 1.0 kgf/cm. As the 
pressure increases, the nip between the backup 
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Fig. 10—Characteristics of transfer and fixing as 
a function of heat roller surface temperature. 


[____] : Electrostatic transfer 


: Nonelectrostatic transfer 


Rising width (”m) 


Transfer Fixing 


Development 


(First transfer) (Second transfer) 


Fig. 11 —Rising width of edge in each step of electro- 
photographic process. 


roller and paper also increases, lengthening the 
toner heating interval. This increases repulsion 
of the fused toner from the transfer medium 
surface and aides toner entry into the paper 
fibers. 

Figure 10 shows the efficiency of the second 
transfer and the fixing ratio at different heat 
roller surface temperatures under a constant 
pressure of 0.5 kgf/cm. Both the transfer 
efficiency and the fixing ratio increase with 
heat roller surface temperature. The fixing 
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: Electrostatic transfer 


: Nonelectrostatic transfer 


Scatter of the area rate (7%) 


Transfer Fixing 
(First transfer) (Second transfer) 


Development 


Fig. 12—Area rate scatter of screen dots in each step of 
electrophotographic process. 


ratio is particularly influenced by temperature. 


4.3 Image sharpness 

Based on the studies just discussed, we set 
up the following experiment. 

Figure 11 shows the rising width of the line 
image edge in electrostatic and nonelectrostatic 
transfer. In electrostatic transfer, the rising 
width in transfer step is 1.9 times that at 
development. Fixing does not increase this ratio. 
In nonelectrostatic transfer, the rising width 
in the first transfer step is 1.4 times that at 
development, with no increase in the second 
transfer step. 

Figure 12 shows scatter in the area rate 
for screen dots in electrostatic and nonelectro- 
static transfer. In electrostatic transfer, the area 
rate scatter in transfer is 6.6 times that at 
development. The rate after fixing is 8 times 
that at development. In _ nonelectrostatic 
transfer, the scatter rate in the first transfer 
step is 1.4 times that at development. In the 
second transfer step, the rate is 3.7 times that 
at development. The reason for this is that 
the surface roughness of plain paper prevents 
sufficient contact between the paper and the 
transfer medium, preventing complete transfer 
of the toner image. In an attempt to clarify 
this phenomenon, we used coated paper with 
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a) Electrostatic 


100 wm 


hes hay 


¥ 
: 


b) Nonelectrostatic 


Fig. 13—Samples of high-resolution image printing. 


a) Electrostatic 


(ee! 


Imm 


b) Nonelectrostatic 


Fig. 14—Printing samples when transferred toner has low electrical resistivity to plain paper. 


a smooth surface, and found that area rate 
scatter was limited to two times that at develop- 
ment. 

To summarize, the nonelectrostatic transfer 
sharper images 
than electrostatic transfer because less toner 
was scattered during the transfer steps. 


method we _ studied enabled 


4.4 Quality of transferred images 
Figure 13 shows examples of a high-resolu- 


tion image printing when using electrostatic 
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transfer a) and nonelectrostatic transfer b). 
The line image resolution is 1016 dots per inch 
(40 lines per millimeter). Clearly, for a toner 
image of 1000 dots per inch, the nonelectro- 
static transfer method much better than 
conventional electrostatic transfer. 

Figure 14 compares plain paper electrostatic 
transfer a) and 
printing samples when using toner having low 
electrical resistivity (10° Qcm). Nonelectrostatic 
transfer clearly produces a sharper toner image 


is 


nonelectrostatic transfer b) 
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on plain paper, even when low electrical resistivi- 
ty toner is used. 


5. Conclusion 

A new transfer method for electrophoto- 
graphic printers has been developed. Use of 
a transfer medium having a high affinity for 
toner and low free surface energy enables images 
to be transferred without electrostatic force. 

A toner image of 1000 dots per inch was 
transferred and was sharp on plain paper because 
of reduced toner scatter. Further, it was possible 
to transfer low electrical resistivity toner to 
plain paper because the electrical characteristics 
of the toner and the paper were independent. 
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Nonelectrostatic transfer produced high- 
quality image printing and is suitable for use in 
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Fe-doped InP, Gay Inq.,AsyP1.,, and Al, GayInj.4-,As grown by liquid phase epitaxy have 
been extensively studied. The temperature, composition, and material dependences of 
Fe doping characteristics of these materials are well explained by three factors: the Fe 
solubility in the growth solution, the background electron concentration, and the Fe 
distribution coefficient. The high resistivities of 8 x 107 ohm-cm, 2 x 10° ohm-cm, and 
1 x 10? ohm-cm have been obtained for InP grown at 900 °C, Gag2g!n974Aso57P0.43 and 


Alo.ag!nos2As grown at 750 °C, respectively. 


The activation energies of the Fe acceptor levels in Gay Inq. AsyPy and Al, GayInqy./As 
systems have been also studied. It is found that the Fe acceptor level is aligned at a constant 


energy relative to the vacuum level. 


1. Introduction 

Semi-insulating (SI) InP and ITII-V alloy semi- 
conductors lattice matched to InP are useful 
for high speed optical and electronic device 
applications. In particular, SI layers are superior 
to p-n junctions as current confining layers 
in Ga, Inj Asy P}4-InP buried heterostructure 
(BH) lasers, because they offer a significant 
reduction in both parasitic capacitance and 
leakage current!)»?), 

In general, SI IlI-V semiconductors are 
obtained by introducing transition metals which 
form deep levels in the forbidden gap. Fe is well 
known as a deep level impurity which provides 
SI properties to InP. Fe-doped SI InP has been 
obtained by the liquid encapsulated Czochraski 
(LEC) and the metal organic vapor phase 
epitaxy (MOVPE) ®. So far, however, it has 
been difficult to obtain the liquid phase 
epitaxial (LPE) layers of SI InP and other I[II-V 
compounds by Fe doping. This is mainly 
because the Fe solubility and distribution coef- 
ficient are very small at the growth temperature 
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usually used. 

Recently, the authors succeeded for the first 
time in achieving LPE growth of Fe-doped SI InP 
by using high growth temperatures above 
850°C”. Moreover, Fe-doped SI Ga, Inj.AsyP}- 
and Al,,Ga,In,.,-»As lattice matched to InP have 
also been obtained by LPE growth at the low 
growth temperature of 750 °C®”. These two 
alloy systems lattice matched to InP have almost 
the same energy gap range, which is very 
important for optical devices. 

The authors’ research into Fe doping 
characteristics of InP, Ga, Inj. As,P)., and 
Al, GayInj4.As is reported in this paper. 
The temperature and composition dependences 
of the Fe doping characteristics of the 
Ga, Inj. Asy P}4, system (including InP) and the 
difference in Fe doping characteristics between 
Gay Inj. As,P), and Al,Ga,Inj,.,As systems 
are mainly discussed. These characteristics are 
explained by three factors: the Fe solubility in 
the growth solution, the background electron 
concentration, and the Fe distribution coef- 
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ficient. The activation energies of the Fe 
acceptor levels in Ga,InjxAsyP), and 
Al,,Ga, Inj 4,» As systems have been also studied. 


2. Experiement 

Undoped or Fe-doped epitaxial layers were 
grown on Sn-doped n‘- or Fe-loped SI InP sub- 
strates. Sn- and Fe-doped layers were also 
grown on Zn-doped p*-InP substrates. InP and 
Ga, Inj x AsyP),, layers were grown on (100) 
oriented substrates and Al,Ga,Inj4.,As were 
grown on (111)B oriented substrates. The 
growth temperatures of InP were 650 °C, 
750°C, 800°C, 850°C, and 900°C. The 
Ga, Inj As,P,, and Al,Ga,Inj4.,As_ layers 
were grown at 750 °C. 

The lattice mismatch between the epitaxial 
layer and the substrate was less than | x 1073 
for In,Ga,xAsyP 3 and Al, GayInj4,.»As, as 
measured using double crystal X-ray diffraction. 

The energy gaps of Ga, In; As, Pj layers 
were 0.75 eV for GalnAs (y = 1), 0.98 eV for 
GalInAsP (y =0.57), and 1.24 eV for GaInAsP 
(vy =0.16), while those of Al,,Ga,In,.,.,As were 
0.80 eV for AlGaInAs (u=0.04), 0.95 eV for 
AlGalInAs (uw=0.16), 1.24eV for AlGalInAs 
(u=0.36), and 1.45 eV for AlInAs (u = 0.48). 
These energy gaps were measured using photo- 
luminescence. The arsenic compositions in 
Ga, Inj. As, P},, layers or the aluminum com- 
positions in  Al,Ga,Inj.,,As layers were 
determined using the composition dependence 
of the energy gap for each alloy system lattice 
matched to InP!® >!) 

Since the residual impurity in the epitaxial 
layers has to be reduced as possible in order to 
obtain the SI properties by Fe doping, the 
materials with the highest purity were carefully 
selected for the LPE growth. It was found that 
the background impurity density was mainly 
determined by the quality of indium. The im- 
purity level of indium was 1077 and that of 
iron was 10. An Alin mother alloy, which 
was composed of indium with an impurity level 
of 10~ and aluminum with the same impurity 
level, was used as the aluminum source!” 
The other source materials, InP, InAs, and GaAs 
were undoped crystals with a carrier concentra- 
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tion of less than 10!® cm-3. Specific purifica- 
tion techniques, such as baking for long periods 
of time'® , were not employed. 

The most serious problem in high tempera- 
ture growth is thermal damage to the InP sub- 
strates and InP epitaxial layers. To eliminate 
this problem for the InP substrate, the InP 
substrate was covered with an InP single crystal 
until the growth started and was etched in situ 
by undersaturated In-P solution prior to the 
growth. The as-grown InP epitaxial layer was 
covered by another InP single crystal immediate- 
ly after the growth. 

The electron concentrations and the resistivi- 
ties of the epitaxial layers were measured using 
Van der Pauw samples which were grown on 
SI substrates. When the resistivities exceeded 
1 x 105 ohm-cm, they were determined from 
the current-voltage characteristics of n* (cap 
layer)-SI(Fe-doped layer)-n*(substrate) structure 
mesa diodes!®. The samples for measurement 
were prepared as follows. Alloyed Au-Ge Ohmic 
contacts were formed on both sides of the 
wafers. Then mesa diodes with a diameter of 
200-800 um were fabricated by chemical 
etching. 

Secondary ion mass spectoscopy (SIMS) 
analyses were performed on undoped and Fe- 
doped samples to estimate the Fe concentrations 
in the epitaxial layers. 

Deep level transient spectroscopy (DLTS) 
measurements were performed on InP (y =0) 
and GalInAs (vy =1) layers using n (Sn- and Fe- 


doped layer)-p*(substrate) mesa diodes!®. 


3. Results 
3.1 Surface morphology 

Normarsky microphotographs of the as- 
grown surfaces of the undoped InP layer and 
0.4 wt% Fe-doped InP layer grown at 900 °C 
are shown in Fig. 1. Both as-grown surfaces 
were mirror-like. No significant degradation 
by high temperature growth and Fe doping 
was observed on the surface morphologies. 
Good surface morphologies were also obtained 
for Ga, Inj As, Pj, and Al, Ga, Inj .,.) As layers 
grown at 750°C. Figure 2 shows the surface 
morphologies of GaInAsP (y=0.57) and 
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———a 


a) Undoped InP 


b) 0.4 wt% Fe-doped InP 


Fig. 1—Surface morphologies of InP grown at 900 "es 


et 


a) GalnAsP (y = 0.57) 


b) AlInAs (u = 0.48) 


Fig. 2—Surface morphologies, grown at 750 Sra 


AllnAs (u = 0.48). 


3.2 Background electron concentration 

Figure 3. shows the growth temperature 
dependence of the background electron con- 
centration at 300 K and the electron mobility 
at 77 K in undoped InP. The electron concentra- 
tion was about 1-1.5 x 10!5 cm™? and was 
almost independent of the growth temperature. 

Figure 4 shows the electron concentration 
(300 K) and electron mobility (77K) of 
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undoped Ga, Inj.,AsyP,, grown at 750 "¢ 
versus the arsenic composition. The electron 
concentration in Ga,Inj.AsyP,, decreased 
gradually from about 1 x 10!5 cm~3 for InP 
(vy =0) to about 5x 1014 cm~? for GalnAs 
(y=1). The electron concentration (300 K) 
and electron mobility (77K) in undoped 
Al,,GayInj.,,As grown at 750°C versus the 
aluminum composition are shown in Fig. 5. 
The electron concentration in Al, Ga, Inj...) As 
was about 1.2 x 1015 cm~? for all the composi- 
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Fig. 3—Growth/temperature dependence of electron 
concentration at 300 K and electron mobility at 
77 K in undoped InP. 
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Fig. 4—Composition dependence of electron 
concentration at 300 K and electron mobility 
at 77 K in undoped Gay, Inj_x Asy P}-y. 


tions. Comparing the results of Figs. 4 and 5, 
the electron concentration in Al,,Ga,Inj.,.)As is 
greater than or equal to that in Ga, Inq x As, P}-y. 
As shown in Figs.3, 4, and 5, high purity 
epitaxial layers of InP, Ga, Inj. As) P)., and 
Al,,Ga, Inj.» As were obtained without specific 
purification techniques. 


3.3 Resistivity 


Figure 6 shows the resistivity of Fe-doped 
InP versus the weight percent of Fe in the 
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Fig. 5S—Composition dependence of electron 
concentration at 300 K and electron mobility 
at 77 K in undoped Al,,Gay, Inq_,-» As. 
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Fig. 6—Resistivity of Fe-doped InP versus the wt% of Fe 
in the growth temperature. The arrow shows the 
Fe solubility in an indium solution at each 
temperature!® (see subsection 4.1.1). 


growth solution at the growth temperatures 
of 800 °C, 850 °C, and 900 °C. At all growth 
temperatures, the resistivity increased as the 
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a 
GalnAsP (y = 0.57) 
a 


GalInAs (y = 1) a 


“ 


Resistivity (ohm+cm) 


10-3 10-2 107) l 
Weight percent of Fe 


Fig. 7—Resistivity of Fe-doped Gay Inj., AsyPj-) versus 
the wt% of Fe in the growth solution. The arrow 
shows the Fe solubility in an indium solution 
at 750 °c!® (see subsection 4.1.2). 


weight percent of Fe increased. The resistivities 
increased abruptly at the critical weight percent 
of Fe at 850°C and 900 °C. The Fe critical 
amount at 900 °C was less than that at 850 °C. 
The resistivities saturated at 5 x 10° ohm-cm 
and 5x 107 ohm-cm at 850°C and 900 °C, 
respectively. The maximum resistivity was 
8 x 107 ohm-cm. This is the highest resistivity 
for InP obtained by LPE to date and is com- 
parable to the resistivity of Fe-doped SI InP 
grown by MOCVD. All the layers were conduc- 
tive at the growth temperature of 800 °C. 

The resistivities of Fe-doped Ga, Inj. As, P},, 
with three compositions (including InP) grown 
at 750 °C versus the weight percent of Fe in the 
growth solution are shown in Fig. 7. GalnAsP 
(y=0.57) and GalInAs (y=1) were semi- 
insulating at the growth temperature of 750 °C, 
while InP (y=0) was conductive. The maxi- 
mum resistivity was about 2 x 105 ohm-cm for 
GalnAsP (y =0.57), and about 2 x 10? ohm-cm 
for GaInAs (y =1). These resistivities were near 
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Fig. 8—Resistivity of Fe-doped Al,,Ga,Inj_,.)As versus 
the wt% of Fe. 
The arrow shows the Fe solubility in an indium 
solution at 750 °C!® (see subsection 4.1.3). 


the maximum theoretical values calculated 
from the energy gaps. The critical weight 
percent of Fe where the GalnAs (y = 1) resistivi- 
ty abruptly increased was more than one order 
of magnitude less than that for GalnAsP 
(y =0.57). 

The resistivities of Fe-doped Al, Ga, Inj .,., As 
with three compositions versus the weight 
percent of Fe in the growth solution are shown 
in Fig. 8. In this figure, the results of GalnAsP 
with the arsenic content of 0.57 (from Fig. 7) 
are also shown. The resistivity increased abrupt- 
ly at almost the same weight percent of Fe, 
and the layers were semi-insulating. The maxi- 
mum resistivity for AllnAs (u = 0.48) with the 
largest energy gap of 1.45 eV in this system 
was about 1 x 10° ohm-cm. This is the highest 
resistivity obtained so far for SI alloys lattice 
matched to InP. 

AlGaInAs (u = 0.16) and GaInAsP ( y = 0.57) 
in Fig.8 have almost the same energy gap. 
Comparing Fe doping characteristics between 
these two alloys, two important points are 
illustrated. The first is that the maximum 
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Fig. 9-Maximum resistivity versus the energy gap of the 
layer for Fe-doped (©) Gay Inj-, Asy Pj.) and (¢) 
Al, Gay Inq_y-y As. 


Current density (A/cm?) 


10-2 107! ] 10 10? 
Voltage (V) 


Fig. 10—Typical current density-voltage characteristics 
of n*-Si-n* diodes at 295 K for Fe-doped InP 
(y = 0) and GaInAsP (y = 0.57). 
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resistivity of both alloys are almost the same. 
This resistivity is near the maximum theoretical 
value calculated from the energy gap. The 
second is that the critical weight percent of Fe 
where the AlGaInAs (u=0.16)_ resistivity 
increased is about one order of magnitude less 
than that for GaInAsP ( y = 0.57). 

Figure 9 shows the maximum resistivities 
obtained for Fe-doped Ga,Inj~ As, Pj.) and 
Al, GayInj_y,As at the growth temperature 
of 750°C versus the energy gap of the layer. 
The results of Fe-doped GalInAsP (vy =0.16) 
and AlGaInAs (u=0.36) are added to this 
figure. 

Typical current density (/)-voltage (v ) 
characteristics of n*-SI-n* diodes for InP (y = 0) 
and GalnAsP (vy =0.57) are shown in Fig. 10. 
When low voltages were applied, the j-y curves 
obeyed Ohm’s law. Above the Ohmic region, 
the current increased rapidly above the critical 
voltage!” . 

Temperature dependences of resistivity for 


Resistivity (ohm+cm) 


eo |. 
GalnAsP 
(v = 0.57) 


108/ T (K~!) 


Fig. 11—Temperature dependence of the resistivity for 
Fe-doped SI InP (y = 0) and GaInAsP 
(y = 0.57). 
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Table 1. Results of the SIMS analysis 


1 MY 
. Wke XFe 

Material Tgrowth (wt.%) tent ) 
800 0.1 1.5 x 10!5 
850 0.1 4x10!5 
InP 900 0.01 eo" 
900 0.1 7x 10° 
900 1 8 x 101° 
GalnAsP (y=0.57) | 750 0.1 25:% 00"° 
GalnAs (y=1) 750 0.1 3 x 101° 
750 0.01 2 x 1015 
AlGaInAs (u = 0.16) ‘3 

750 0.05 9x10 
AlGaInAs (u = 0.04) | 750 0.03 8 x 10!5 
AlGalInAs (u = 0.36) | 750 0.03 6 x 10!5 
AllInAs (uw=0.48) | 750 0.03 5x 10!5 


SI InP (y=0) and GalnAsP (y=0.57) are 
shown in Fig. 11. This resistivity was near 
the maximum resistivity for each material. 
The activation energy of resistivity for InP 
(vy =0) and GalInAsP (vy =0.57) were 0.62 eV 
and 0.47 eV, respectively, whereas the activation 
energies of resistivity for SI AlGalInAs with 
an aluminum content of 0.04, 0.16, and 0.36 
were 0.35 eV, 0.52 eV, and 0.78 eV, respectively. 


3.4 Fe distribution coefficient 

The results of SIMS analysis on the 
Fe-doped InP, Ga, In} x Asy P1-y, and 
Al,,Ga,Inj.y.»As are shown in Table 1. The Fe 
concentration in the InP layer increased with 
the growth temperature or the Fe weight 
percent in the low doping range, but saturated 
at 1 wt% Fe in the solution. The Fe concentra- 
tions in Ga, Inj.,AsyP)., and Al, Ga,Inj-y.)As 
varied with the composition. 

The Fe distribution coefficients could be 
estimated from the results of SIMS analysis 
in the low doping range, because the Fe con- 
centration obtained from SIMS analysis was 
nearly proportional to the Fe content in the 
growth solution. Figure 12 shows the Fe distri- 
bution coefficient for InP versus the reciprocal 
of the growth temperature. The distribution 
coefficient for InP was about 2x10, 
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Fig. 12—Temperature dependence of the Fe distribution 
coefficient for InP. 
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Fig. 13—Composition dependence of the Fe distribution 
coefficient for Gay Inj_y Asy P}-y. 


5x10. and 1x10 a 200°C. $507°¢C, 
and 900°C, respectively. The distribution 
coefficient at 1070°C, predicted by ex- 
trapolating the line in Fig. 12, agrees well with 
that reported for LEC grown Fe-doped InP 
crystals!®) : 

Figure 13. shows the composition de- 
pendence of the distribution coefficient for the 


Ga, Inj, As,Pj., system (including InP) at 
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Fig. 14—Composition dependence of the Fe distribution 
coefficient for Al,,Ga, Inq_y.)As. 
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Fig. 15—Typical DLTS spectra of n-type Fe-doped InP 
(y = 0) and GalnAs (y = 1). 


750 °C. The distribution coefficient was about 
1x10 for InP (y =0), about 3 x 10 for 
GaInAsP (vy =0.57), and about 4x 10~ for 
GalInAs (y=1). In particular, the distribution 
coefficient increases about 40 times as the 
composition varies from InP (vy =Q) to GaInAs 
(y= 1). 

The Fe distribution coefficient for the 
Al,,Ga,Inj.,.)»As system at 750°C versus the 
composition are shown in Fig. 14. The distribu- 
tion coefficient was about 2 to 4x 10%, and 
was almost the same for all the composition. 
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Fig. 16—Plots of In (T? /e, ) versus 1/T. 


3.5 Activation energy of Fe acceptor level 

Figure 15 shows a typical DLTS spectra of 
Fe-doped InP (y=0) and GalInAs (y=1). 
Capacitance transients caused by electron 
emission from an impurity level are observed. 
Since these peaks are not observed in undoped 
InP (vy =0) and GalnAs (y = 1), this impurity 
level can be assigned to the Fe acceptor level. 
The electron emission rate e, was measured 
as a function of temperature in order to obtain 
the activation energies of this level. Figure 16 
shows the plots of In (T?/e, ) versus 1/7. From 
the results of this figure, the activation energy 
was 0.63 eV in InP (y=0) and 0.37 eV in 
GalnAs (y = 1)". 


4. Discussion 
4.1 Fe doping characteristics 

There are three factors in determining the 
Fe doping characteristics in LPE: the Fe solu- 
bility in the growth solution, the background 
electron concentration, and the Fe distribution 
coefficient. Considering these factors, we shall 
discuss the temperature dependence of Fe 
doping characteristics of InP, the composition 
dependence of Fe doping characteristics of 
the Ga,In;.,Asy)P}., system, and the differ- 
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ence of Fe doping characteristics between the 
Ga, Inj.. As, Py.) and Al, Gay Inj.4-»As system. 

4.1.1 Temperature dependence of Fe 
doping characteristics of InP 

Two important points are illustrated in the 
results of the resistivities of Fe-doped InP shown 
in Fig. 6. The first is that the maximum resistivi- 
ty at each growth temperature increase with 
the growth temperature. The second is that the 
critical weight percent of Fe where the resistivi- 
ty of the layer abruptly increased decreases with 
the growth temperature. 

The first point can be explained by the 
temperature dependence of the Fe solubility in 
the growth solution. The Fe solubility in an 
indium solution at each temperature!°? is shown 
with an arrow in Fig. 6. Each value agrees well 
with the weight percent of Fe where the resis- 
tivity is saturated. This result shows that the Fe 
solubility in the growth solution for InP is al- 
most the same as that in an indium solution. 
Therefore, the temperature dependence of the 
maximum resistivity is caused by the tempera- 
ture dependence of the Fe solubility in the 
solution. 

At the critical weight percent of Fe, the 
background electron concentration in the epi- 
taxial layer is nearly equal to the concentration 
of Fe deep acceptor. Therefore, the second 
point is due to the strong positive temperature 
dependence of Fe _ distribution coefficient, 
considering that the background electron 
concentration is almost independent of the 
growth temperature as shown in Fig. 3. It is 
concluded that the temperature dependence of 
Fe doping characteristics of InP is due to the 
temperature dependence of Fe solubility and 
distribution coefficient. 

4.1.2 Composition dependence of Fe doping 
characteristics of the Ga, Inj. As, P}_, 
system 

It is clear from Fig. 7 that SI Ga, Inj. Asy P}., 
is easier to obtain as the composition varies from 
InP (v =0) to GaInAs (y = 1). 

The Fe solubility in an indium solution at 
750 °C!®) is shown with an arrow in Fig. 7. This 
value agrees well with the weight percent of Fe 
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where the resistivities of GalnAsP (vy =0.57) 
and InP (y =O) became saturated. This means 
that the Fe solubility in the growth solution for 
Ga, Inj., As, P;.) is the same as that in an indium 
solution and independent of the composition. 

The background electron concentration de- 
creased gradually from InP (y =0) to GalnAs 
(vy = 1) as shown in Fig. 4, and the Fe distribu- 
tion coefficient increased greatly from InP 
(vy =0) to GalInAs (y = 1) as shown in Fig. 12. 
Therefore, the composition dependence of Fe 
doping characteristics of the Ga, Inj., As, P}.) 
system is caused by the composition dependence 
of background electron concentration and Fe 
distribution coefficient. In particular, the Fe 
distribution coefficient is a dominant factor. 

4.1.3 Comparison of Fe doping character- 

istics between Ga,Inj.,Asy Pj.) and 
Al,, Ga, Inj.» As systems 

Ga, Iny.,As,P}., and Al,Ga,Inj.,.,As sys- 
tems lattice matched to InP have almost the 
same energy gap range. It is significant for 
application to compare the Fe doping character- 
istics between these two alloy systems. From 
Figs. 8 and 9, it is found that SI Al,,Ga,In,.,., As 
is easier to obtain than SI Ga, Inj. Asy P}_,. 

The Fe solubility in an indium solution at 
750 °C!® is shown with an arrow in Fig. 8. It 
is the same as that in the growth solution for 
Ga, Inj. Asy Pj.) , as discussed in Subsec. 4.1.2. 
SI Al,,Ga,Inj_,.,As can be obtained at the 
smaller weight percent of Fe than the Fe solubili- 
ty in the growth solution for Gay Inj., As, P}.). 
Therefore, the Fe solubility in the solution does 
not cause the difference in the doping character- 
istics. 

The background electron concentration in 
Al, GayInjy.,As is greater than or equal to that 
in Ga, Inj., As, P|.) , as shown in Figs. 4 and 5. 
Because of this, the background electron con- 
centration does not cause Al, Ga, Inj.y-)As 
to become semi-insulating more easily than 
Gay Inj., Asy Pj). 

Figure 17 shows the Fe distribution co- 
efficient versus the energy gap of the layer for 
Ga, Inj.,AsyP}.» and Al,GayInj.,.,As. It is 
clear that the Fe distribution cofficient of 
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Fig. 17—Fe distribution coefficient versus the energy gap 
of the layer for Ga, Inj. Asy P}-) and 
Al, Gay Inj_y-yAs. 


Al,,GayInj.,-»As is much greater than that of 
Ga, In,.,As,Pj). at the same energy gap. For 
example, the Fe distribution coefficient of 
AlGaInAs (uw=0.16) with an energy gap of 
0.95 eV is about 3 x 107 as shown in Fig. 12, 
while that of GalnAsP (y = 0.57) with the same 
energy gap is about 4 x 107° as shown in Fig. 13. 
Although the background electron concentra- 
tion of AlGalInAs (u = 0.16) with this energy gap 
is twice as great as that of GalnAsP (y =0.57) 
from the results shown in Figs. 4 and 5, the high 
distribution coefficient of AlGaInAs (u = 0.16) 
can make up for the disadvantage of the back- 
ground level. As a result, AlGaInAs (u =0.16) 
can be made semi-insulating for a smaller weight 
percent of Fe in the growth solution than that 
for GaInAsP (y = 0.57), as shown in Fig. 8. It is 
concluded that SI Al,,Ga,Inj.,.,As is easier to 
obtain than SI Ga, Inj., As, P}.) , because of the 
high Fe distribution coefficient. 
4.2 Fe acceptor level in Ga,Inj.,Asy Pj.) and 

Al,, Ga, Inj_,.)»AS systems 

The activation energies of the Fe acceptor 
level were 0.63 eV in InP (y =O) and 0.37 eV 
in GalInAs (y = 1), from the results of the DLTS 
measurements in section 3.5. 

The activation energies of resistivities 
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Fig. 18—Activation energy of Fe accepted level in 
Gay Inj. Asy P}-y. 
The band diagram is described by the results of 
references 10 and 19. 


for Fe-doped SI InP, Ga, In,., As, Pj), and 
Al,,GayInj_,.»AS were obtained in Sec. 3.3. 
These activation energies show the locations of 
the Fermi levels in the forbidden gaps of the 
materials. Since the temperature dependence of 
the resistivity was measured for the SI layer 
whose resistivity was near the maximum resis- 
tivity, it is considered that the Fe concentration 
in the layer was larger than the background 
electron concentration. Therefore, the Fermi 
level lies almost at the Fe acceptor level. Namely, 
the activiation evergy of the resistivity is almost 
the same as that of the Fe acceptor level. The 
activation energy of 0.62 eV for InP obtained 
from the temperature dependence of resistivity 
agrees well with the value obtained by DLTS 
measurement. Figures 18 and 19 show the 
activation energies of the Fe acceptor level in 
Ga, Inj., As, Pj.) and Al,Ga,Inj.,.»As, respec- 
tively. The band diagram of Ga, Inj.,Asy Pj-) 
shown in Fig. 18 is described by using the com- 
position dependence of energy gap reported by 
R.E. Nahory et al!®°), and the band-edge dis- 
continuity measured by S.R. Forrest et all!) 
The band diagram of Al,,Ga,Inj.y.)As in Fig. 19 
is described by using the results of references 11 
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Fig. 19—Activation energy of Fe acceptor level in 
Al,, Gay Inq-y-y As. 
The band diagram is described by the results of 
references 11 and 20. 


and 20. 

The activation energy of Fe acceptor level in 
Gax Inj., As) P}.,, decreased as the arsenic con- 
tent increased and that in Al,,Ga,In,.,.)As in- 
creased as the aluminum content increased. 
An important point is that, for both systems, 
the Fe acceptor level is aligned at a constant 
energy relative to the vacuum level, though the 
activation energy depends on the composition. 
This agrees well with the vacuum-referred bind- 
ing energy (VRBE) model*’?. 


5. Conclusion 

Fe-doped InP, Ga, Inj, As, P}.), and Al, Ga, 
Inj.,-»As grown by liquid phase epitaxy were 
extensively studied. The temperature, composi- 
tion, and material dependence of Fe doping 
characteristics of these materials are well ex- 
plained by three factors: the Fe solubility in the 
growth solution, the background electron con- 
centration, and the Fe distribution coefficient. 
The high resistivities of 8 x 107 ohm-cm, 2 x 
105 ohm-cm, and | x 10? ohm-cm were obtained 
for InP grown at 900 °C, Gao.26 Ino,74 Aso.57Po.43 
and Alg.4g Ing.52 As grown at 750 °C, respectively. 

The activation energies of the Fe acceptor 
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levels in Ga, Inj., AsyP}., and Al, Gay Inj-y-» As 
systems were also studied. It was found that the 
Fe acceptor level is aligned at a constant energy 
relative to the vacuum level for both alloy 
systems. 
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for Magnetic Circuit Yoke 
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To produce a soft magnetic Fe-50%Co alloy using powder metallurgy, a study was made 
on the effects of various starting powders on the sintered density and magnetic properties 
of this alloy. An Fe-50%Co sintered alloy with a relative density of 95 percent and a mag- 
netization of 2.15 7 ina magnetic field of 4 kA/m was obtained using pre-alloyed Fe-20%Co 


and —400 mesh pure Co powders as the starting powders. This sintered alloy was used in 
a 24-wire-dot matrix printer. This printer operated at a printing speed of 110 cps (for Japa- 
nese character ‘‘kanji’’ printing) due to the higher magnetization afforded by the Fe-50%Co 


sintered alloy. 


1. Introduction 

In 1926, Elmen! invented the Fe-50%Co 
(all percentages in this paper are based on mass) 
“Permendur” alloy. This alloy has the highest 
saturation magnetization of all ferromagnetic 
materials. By using this alloy, it was possible to 
develop higher performance and smaller size 
electro-magnetic components such as magnetic 
circuit cores and yokes. 

However, the fabrication of Fe-50%Co 
alloy into various forms was very difficult 
because of its brittleness. To make this material 
less brittle, White and Wahl” introduced the 
2%V-49%Fe-Co alloy in 1932. Nearly fifty years 
have passed since its introduction, and _ its 
application is still limited to a few special cases. 
This may be attributed to the low workability 
due to the material’s inherent brittleness and the 
high cost of cobalt. 

Powder metallurgy is suitable for materials 
of low workability, such as Fe-50%Co alloy, 
to produce near-net shape parts. However, 
very little work has been done in the field 
of Fe-Co sintered alloys. This is probably 
due to the difficulty in obtaining a high density 
sintered alloy using a mixture of pure Fe and 
Co powders. The reason for this difficulty is that 
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the difference in Fe and Co diffusion coeffi- 
cients causes the formation of Kirkendall voids 
during sintering. 

This paper presents the results of our 
research on the sintering behavior of Fe-50%Co 
alloy based on our investigation of the sintered 
density of samples that use pre-alloyed Fe-Co 
powders of various Co content, and Co or Fe 
powder of different particle sizes. This paper 
also describes the magnetic properties and 
microstructures of the resulting Fe-50%Co 
sintered alloys, and the application of this 
sintered alloy for the magnetic circuit yoke of 
the print head used in a 24-wire-dot matrix 
printer. 


2. Experimental procedure 
Table 1 lists the starting powders. Water- 


Table 1. Starting powders 


Starting Retrication Particle size 
powders (mesh) 
Pre-alloyed : 
Fe-x%Co* Water-atomized —325 
Fe Electrolytic —325, —400 
Co Reduced —325, —400 
*x = 10-90 
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atomized —325 mesh pre-alloyed Fe-x%Co 
powders (x=10-90), reduced Co powder 
(—325 and —400 mesh), and an electrolytic 
Fe powder (—325 and —400 mesh) were used. 
These powders were mixed to obtain a composi- 
tion of Fe-50%Co. For example, when the pre- 
alloyed Fe-20%Co powder was used, pure Co 
powder was added to obtain a composition of 
Fe-50%Co. The mixture of Fe-50%Co composi- 
tion was compacted into a 45/35 mm by 7 mm 
thick ring under a compacting pressure of 
390 MPa. The samples were then presintered 
and compacted again. Finally, these samples 
were sintered for 1 h at 1 400 °C ina dry hydro- 
gen atmosphere. 

The green density was calculated from the 
size and weight of the compacted ring samples. 
The sintered density was measured according 
to the Archimedes method. The relative density 
was calculated as the ratio of green and sintered 
density to the theoretical density”. 

Ring sample magnetic properties, such as 
magnetization (B4,), coercive force (H,) and 
maximum permeability (um) were measured 
using a DC magnetic hysteresis loop tracer 
ina magnetic field of 4 kA/m. 

The microstructures were studied using an 
optical microscope and an electron’ probe 
microanalyzer. 

Finally, the sintered alloy was applied to 
a magnetic circuit yoke of a print head in 
a 24-wire-dot matrix printer. The printing 
force and printing speed were then studied. 


300 
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Fe-x%Co alloy composition 


Fig. 1—Hardness of Fe-Co binary alloy. 
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3. Results and discussion 
3.1 Sintering behavior 

Figure | shows the hardness of the Fe-Co 
binary alloy. Maximum hardness is obtained 
with a Co content of 50 percent. This may 
be due to the ordering in this alloy”. 

Figure 2 shows the effect of the Co composi- 
tion (x) of pre-alloyed Fe-x%Co powders on 
the green densities for the Fe-50%Co compacted 
samples. In this figure, x =O is for the sample 
that uses a mixture of pure Fe and Co powders. 
The value at x = 50 is for the sample that uses 
pre-alloyed Fe-50%Co powder only. However, 
this pre-alloyed powder was too hard (as shown 
in Fig. 1) to be compacted. The green densities 
increase as the content of pure Fe or Co 
powders increases. In other words, the green 
densities are affected by both the hardness 
of the pre-alloyed Fe-Co powder and the 
content of pure Fe or Co powder in the mix- 
tures. Conversely, the green densities are also 
affected by Fe and Co powder particle size. 
This is probably because finer Fe and Co 
powders are more suitable for packing with the 
coarser pre-alloyed Fe-x%Co powders. 

Figure 3 shows the effect of the Co composi- 
tion (x) of pre-alloyed Fe-x%Co powder on the 
sintered densities for the Fe-50%Co sintered 
alloy. In this figure, x =O and x =50 are the 


Pre-alloyed powder 
and Co powder 


Pre-alloyed powder 
and Fe powder 


Particle size of Co powder 
Particle size of Fe powder 


—400 mesh 


o—_e_ 
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Green density (percent) 


—325 mesh 


Not compacted 


0 20 4() 60 80 100 


Co composition (x) of pre-alloyed Fe-x%Co powder 


Fig. 2—Effect of Co composition (x) of pre-alloyed 
Fe-x%Co powder on green densities for 
Fe-50%Co compacted samples. 
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same as in Fig.2. The sintered densities are 
greatly affected by the Fe and Co powder 
particle size. This is probably because the finer 
Fe and Co powderes are more active during 
sintering. However, a poor correlation between 
the green densities in Fig. 2 and sintered densi- 
ties in Fig. 3 is observed. The reason for this is 


Pre-alloyed powder 


Pre-alloyed powder 
and Co powder 


and Fe powder 


Particle size of Co powder 


Particle size of Fe powder 
—400 mesh 


er 


—325 mesh 


Sintered density (Percent) 


0 20 40 60 80 100 


Co composition (x) of pre-alloyed Fe-x%Co powder 


Fig. 3—Effect of Co composition (x) of pre-alloyed 
Fe-x%Co powder on sintered densities for 
Fe-50%Co sintered alloy. 


a) (Fe-20%Co) + Co 


assumed to be that the interdiffusion between 
the pre-alloyed Fe-xx%Co and Co powders de- 
pends on the crystal structure of pre-alloyed Fe- 
x%Co powder. In other words, the anomalous 
behavior (for x = 30 and x = 70 in Fig. 3) corre- 
sponds to the formation of an ordered structure 
based on the Fe3;Co and FeCo3 composition. It 
is currently difficult to account for these experi- 
mental results. Further detailed experiments on 
this point should be conducted. The maximum 
value of the sintered densities is found in the 
sample that uses the pre-alloyed Fe-20%Co and 
—400 mesh Co powders. Generally, the magnetic 
properties of sintered magnetic materials, By, in 
particular, depend on the sintered density. There- 
fore, this discussion focuses on a comparison 
between the sample that uses _ pre-alloyed 
Fe-20%Co and —400 mesh Co powders and the 
sample that uses —400 mesh pure Fe and Co 
powders. 

Figure 4 shows cross-sectional views of the 
samples that use: 

a) Pre-alloyed Fe-20%Co and —400 mesh 

Co powders, and 
b) —400 mesh pure Fe and Co powders. 


‘comsersmmvemressccrmscuenes 
100 wm 


b) Fe +Co 


Fig. 4—Microstructures. 
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The pore size and pore densities in a) are 
smaller and fewer than those in b). This suggests 
that the sintering process for each sample is 
different. 

Figure 5 shows the relationship between 
the sintering temperature and sintered density 
of the sample that uses a mixture of pre-alloyed 
Fe-20%Co and -—400 mesh Co powders, as 
compared to that of the sample that uses a 
mixture of —400 mesh pure Fe and Co powders. 
In the sample that uses the pre-alloyed powder, 
the sintered density increases abruptly at 
temperatures between 400°C and 600°C. In 
comparison, similar behavior is not observed 
in the sample that uses a mixture of pure Fe 


100 


(Fe-20%Co) + Co 


x 


Sintered density (percent) 


0) 300 600 900 1 200 1500 
Sintering temperature (°C) 


Fig. 5—Relationship between sintering temperature and 
sintered density. 


and Co powders. 

Figure 6 shows photomicrographs of the 
grain boundaries of the pre-alloyed Fe-20%Co 
powder and ~—400 mesh pure Fe_ powder. 
Note that the pre-alloyed Fe-20%Co powder 
has more grain boundaries than the pure Fe 
powder. Grain boundaries are generally thought 
to promote the elimination of pores and 
Kirkendall voids. This may affect the sintering 
of the sample that uses a mixture of pre-alloyed 
Fe-20%Co and —400 mesh Co powders in the 
early stages at temperatures between 400 °C and 
600 °C. Therefore, the sintered density of the 
sample that uses a mixture of pre-alloyed 
Fe-20%Co and —400 mesh Co powders is higher 
than that of the sample that uses —400 mesh 
pure Fe and Co powders. 


3.2 Sintered densities and magnetic properties 
Figure 7 shows the effect of the Co composi- 
tion (x) of the pre-alloyed Fe-x%Co powder 
on the magnetic properties of the Fe-50%Co 
sintered alloy that uses a mixture of pre-alloyed 
Fe-Co powder and —400 mesh Fe or Co powder. 
In this figure, x =O and x = 50 are the same as 
in Fig. 2. B4, reaches a maximum of 2.15 T for 
the sample that uses pre-alloyed Fe-20%Co and 
—400 mesh Co powders. This corresponds to 
the behavior of the sintered density shown in 
Fig. 3. Moreover, this sample has the optimal H, 


a) Pre-alloyed Fe-20%Co powder 


ae 
20 wm 


b) Pure Fe powder 


Fig. 6—Microstructures of pre-alloyed Fe-20%Co powder and pure Fe powder. 
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and wm values of all samples. 

Figure 8 shows the relationship between 
the sintered density and B4, for the sample 
that uses a mixture of pre-alloyed Fe-20%Co 


Lm (mH/m) 


By, (T) 


H, (A/m) 


0 20 40) 60 80 100 
Co composition (x) of pre-alloyed Fe-x%sCo powder 


Fig. 7—Effect of Co composition (x) of pre-alloyed 
Fe-x%Co powder on magnetic properties for 
Fe-50%Co sintered alloy. 


and —400 mesh Co powders, as compared 
to that of the sample that uses a mixture of 
—400 mesh pure Fe and Co powders. The 
sample that uses the pure powders has a By, of 
167 at a sintered density of 90 percent. 
Theoretically, if the sintered density of this 
sample were 95 percent, its B4, would be 
1.68 T. For comparison, the sample that uses 
the pre-alloyed powder has a By, of 2.15 T at 
a sintered density of 95 percent. This differ- 
ence suggests that factors other than sintered 


(Fe-20%Co) + Co ee; 


Influence of 
other factors 


Calculated value 


90) 95 100 


Sintered density (percent) 


Fig. 8—Relationship between sintered density and Ba,. 
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Fig. 9—Concentration maps. 
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a) Magnetic yoke 


b) Print head 


Fig. 10—Print head yoke and head. 


density affect magnetization. These factors, 
apart from sintered density, may include the 
degree of uniform Fe-50%Co alloy composition, 
order-disorder transformations, and a residual 
Y phase°?. Here, we will discuss the uniformity 
of the alloy composition based on the results of 
electron microprobe analysis. 

Figure 9 shows the concentration maps of 
the characteristic X-ray of Fe for the samples 
in Fig. 8. In this figure, the green region cor- 
responds to the composition near Fe-50%Co, 
the yellow region corresponds to the Fe-rich 
parts, and the blue region corresponds to the 
Fe-poor parts and pores. The alloy composition 
of the sample that uses pre-alloyed powder 
is much more uniform than that of the sample 
that uses pure powders. This is the main reason 
why the sample that uses pre-alloyed Fe-20%Co 
and —400 mesh Co powder has the highest 
magnetization of all samples. Consequently, 
it was found that the pre-alloyed powder tech- 
nique is an effective means of obtaining good 
uniformity in alloy composition. However, 
such good uniformity in alloy composition 
does not account for all of the high magnetiza- 
tion of this sintered alloy. Further investigation 
into the order-disorder transformations and 
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Fig. 11—Print head scheme. 


behavior of the y phase in this sintered alloy 
should be conducted. 


3.3 Application in a print head 

This sintered alloy was used in the magnetic 
circuit yoke of the print head of a 24-wire-dot 
matrix printer. Figure 10 shows the part that 
uses the sintered Fe-50%Co alloy, and also 
shows the print head. 
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Figure 11 shows the arrangement of the 
print head of the 24-wire-dot matrix printer. 
The print-wire is fixed to an armature and the 
spring system is normally retracted by a 
magnetic attractive force between the armature 
and core. This magnetic attractive force is 
generated by the magnetic field (indicated by 
the dashed line), of the permanent magnet. The 
field flows through the core and yoke. This 
magnetic attractive force holds the wire back. 
When the opposing magnetic field (indicated 
by the bold line) is induced by the coil, the 
energy stored in the retracted spring causes the 
wire to shoot forward. Accordingly, if a higher 
magnetic field is possible, a stronger spring can 
be used. This will result in a higher printing 
speed. 

Figure 12 shows the magnetic attractive 
force versus the wire stroke of the print head 
that uses the Fe-50%Co sintered alloy, as 
compared to that of the Fe-3%Si sintered alloy. 
The Fe-3%Si alloy is normally used for mag- 
netic circuit yoke and cores. The Fe-3%Si 
sintered alloy used in this study has a By, of 
1.67, H, of 35 A/m, and yu, of 22.5 mH/m. 
For each wire stroke, the magnetic attractive 
force of the print head that uses Fe-50%Co 
sintered alloy is larger than that of the print 
head that uses Fe-3%Si sintered alloy. This 
is due to the higher magnetization of the 


Magnetic attractive force (NV) 


0 0.2 0.4 0.6 0.8 
Wire stroke (mm) 


Fig. 12—Magnetic attractive force vs. wire stroke. 
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Fe-50%Co sintered alloy. 

As a result, the printer was able to print at 
a printing speed of 110 cps for ‘‘kanji” 
character printing, and 330 cps for alphanumeric 
character printing. These are the fastest printing 
speeds to date for a 24-wire-dot matrix printer. 


4. Conclusion 
We have developed an Fe-50%Co alloy 

by using powder metallurgy. Because this 

sintered alloy has high density and good 
uniformity of alloy composition, it also has 
good magnetic properties. 

The results obtained can be summarized 
as follows: 

1) A mixture of pre-alloyed Fe-20%Co and fine 
Co powder is very effective for fabricating 
Fe-50%Co alloys by powder metallurgy. This 
is partly due to the lower hardness of 
Fe-20%Co powder when compared to other 
Fe-x%Co powders, which makes it suitable 
for packing: and partly due to the higher 
grain boundary density of pre-alloyed pow- 
der, which accelerates sintering and promotes 
the elimination of pores and Kirkendall 
voids. The fine Co powder is also more 
active during sintering. The pre-alloyed pow- 
der technique plays an important role in 
obtaining good uniformity of alloy composi- 
tion. This sintered alloy exhibits a relative 
density of 95 percent and a By, of 2.15 T, 
which makes it applicable for development 
of higher performance and smaller size 
electro-magnetic components. 

2) This sintered alloy was used in the magnetic 
circuit yoke of the print head of a 24-wire- 
dot matrix printer. The printer operated at 
a high printing speed of 110 cps for “‘kanji”’ 
character printing due to the high magnetiza- 
tion of the Fe-50%Co sintered alloy. 
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